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PART I 


PRELIMINARY NOTE; CHAPTERS ON 
DETERMINANTS 

PRELIMINARY NOTE 

1. Number 

In its initial stages algebra is little more than a generaliza- 
tion of elementary arithmetic. It deals only with the positive 
integers, 1, 2, 3,... . We can all remember the type of problem 
that began ‘let x be the number of eggs’, and if x came to 3| we 
knew we were wrong. 

In later stages x is permitted to be negative or zero, to be the 
ratio of two integers, and then to be any real number either 
rational, such as 3J or — J, or irrational, such as tt or V3. Finally, 
with the solution of the quadratic equation, x is permitted to be 
a complex number, such as 2+3*. 

The numbers used in this book may be either real or complex 
and we shall assume that readers have studied, to a greater or 
a lesser extent, the precise definitions of these numbers and 
the rules governing their addition, subtraction, multiplication, 
and division. 

2. Number rings 

Consider the set of numbers 

0 , ± 1 , ± 2 , .... ( 1 ) 

Let r,a denote numbers selected from (1). Then, whether r and 
a denote the same or different numbers, the numbers 

r+a, r—s, rxa 

all belong to (1). This property of the set (1) is shared by other 
sets of numbers. For example, 

all numbers of the form a+6V5, (2) 

inhere a and b belong to (1), have the same property; if r and a 
elong to (2), then so do r+a, r—s, and rxa. A set of numbers 
having this property is called a ring of numbers. 

4703 -n 
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3. Number fields 

3.1. Consider the set of numbers comprising 0 and every 
number of the form pjq, where p and q belong to (1) and q is not 
zero, that is to say, 


the set of all rational real numbers. 


(3) 


Let r,s denote numbers selected from (3). Then, l when s is not 

zero, whether r and s denote the same or 'different numbers, 

the numbers , w 

r+s , r—s, rxs, r-^s 


all belong to the set (3). \ 

This property characterizes what is called a field of numbers. 
The property is shared by the following sets, among many 
others : — 


the set of all complex numbers; (4) 

the set of all real numbers (rational and irrational); (5) 

the set of all numbers of the form p+q\ 3, where p and 
q belong to (3). (6) 

Each of the sets (4), (5), and (6) constitutes a field. 


Definition. A set of numbers , real or complex , is said to form a 
field of numbers when , if r and s belong to the set and s is not 
zero 

’ r+s, r—s, rxs, r^-s 

also belong to the set. 

Notice that the set (1) is not a field; for, whereas it contains 
the numbers 1 and 2, it does not contain the number 

3.2. Most of the propositions in this book presuppose that 
the work is carried out within a field of numbers; what particular 
field is usually of little consequence. 

In the early part of the book this aspect of the matter need 
not be emphasized: in some of the later chapters the essence of 
the theorem is that all the operations envisaged by the theorem 
can be carried out within the confines of any given field of 
numbers. 

In this preliminary note we wish to do no more than give a 
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formal definition of a field of numbers and to familiarize the 
reader with the concept. 

4. Matrices 

A set of mn numbers, real or complex, arranged in an array of 
m columns and n rows is called a matrix. Thus 


a ll 


• 


a 21 

®22 

■ 

a 2m 

a nl 

a n2 


Q'n m 


is a matrix. When m = n we speak of a square matrix of order n. 

Associated with any given square matrix of order n there are 
a number of algebraical entities. The matrix written above, 
with m = n, is associated 

(i) with the determinant 


a ll 

«12 ■ 

• 


®21 

®22 

■ 

a 2n 

a /il 

a n2 

. 

a rin 


(ii) with the form 

n n 

2 2 a n x r x„ 

r =1 s=- 1 

of degree 2 in the n variables x v z 2 ,..., x n \ 

(iii) with the bilinear form 

n n 

2 2 a rs *r Vs 

r = 1 s— 1 

in the 2 n variables x v ... f x n and y n \ 

(iv) with the Hermitian form 

n 

r — 1 s- 1 

where x 8 and x 8 are conjugate complex numbers; 

(v) with the linear transformations 

n 

X r = l a T3 X 8 ( r = 

8 = 1 

n 

Lg = ^Cl> r3 l r ( s = l,...,7l). 

r= 1 
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The theories of the matrix and of its associated forms are 
closely knit together. The plan of expounding these theories 
that I have adopted is, roughly, this: Part I develops properties 
of the determinant; Part II develops the algebra of matrices, 
referring back to Part I for any result about determinants that 
may be needed; Part III develops the theoity of the other 
associated forms. y 



CHAPTER I 


ELEMENTARY PROPERTIES OF DETERMINANTS 

1 . Introduction 

1 . 1 . In the following chapters it is assumed that most 
readers will already be familiar with determinants of the second 
and third orders. On the other hand, no theorems about such 
determinants are assumed, so that the account given here is 
complete in itself. 

Until the middle of the last century the use of determinant 
notation was practically unknown, but once introduced it 
gained such popularity that it is now employed in almost every 
branch of mathematics. The theory has been developed to such 
an extent that few mathematicians would pretend to a know- 
ledge of the whole of it. On the other hand, the range of theory 
that i£ of constant application in other branches of mathematics 
is relatively small, and it is this restricted range that the book 
covers. 

1.2. Determinants of the second and third orders. 

Determinants are, in origin, closely connected with the solu- 
tion of linear equations. 

Suppose that the two equations 

a^+byy = 0, a 2 x+b 2 y = 0 

are satisfied by a pair of numbers x and y, one of them, at least, 
being different from zero. Then 

b 2 {a 1 x+b 1 y)—b 1 (a 2 x+b i y) = 0, 
and so (a l b 2 —a 2 b 1 )x = 0. 

Similarly, (a 1 b 2 —a. 2 b 1 )y = 0, and so a 2 b 2 — a 2 b 2 = 0. 

The number a l b. 2 —a 2 b 1 is a simple example of a determinant; 
it is usually written as 

°i (1) 

o, b 2 

The term o 1 b 2 is referred to as ‘the leading diagonal’. Since 
there are two rows and two columns, the determinant is said 
to be ‘of order two’, or ‘of the second order’. 



6 ELEMENTARY PROPERTIES OF DETERMINANTS 

The determinant has one obvious property. If, in (1), we 
interchange simultaneously a x and b lf a 2 and 6 2 , we ge£ 
b 1 a 2 —b 2 a i instead of a x b 2 — a 2 b x . 

That is, the interchange of two columns of (1) reproduces the 
same terms, namely a l b 2 and a 2 b v but in a different order and 
with the opposite signs . , 

Again, let numbers x , y, and z, not all zero, Satisfy the three 

equations a^+b^+c^ = 0, (2) 

a 2 x+b 2 y+c 2 z = 0, (3) 

« 3 *-f b 3 y+c 2 z = 0; (4) 

then, from equations (3) and (4), 

(<x 2 6 3 a 3 b 2 )x (b 2 c 3 b 3 c 2 )z = 0, 

(d 2 b 3 a 3 b 2 )y (c 2 ® 3 c 3 q 2 )2 ~ 0, 
and so, from equation (2), 

2 K( fc 2 c 3- ft 3 c 2)+ 6 i( c 2 fI 3— c 3 a 2)+Ci{a 2 b3—a 3 b 2 )} = 0. 

We denote the coefficient of 2 , which may be written as 

b 2 ^3 a i ^3 ^ 2 “ l “®2 ^3 ^1 ®2 ^1 ^ 3 “ l ~ a 3 b l ^2 b 2 ^ 1 > 

by A; so that our result is 2 A = 0. 

By similar working we can show that 

xA = 0, yA = 0. 

Since x , y, and 2 are not all zero, A = 0. 

The number A is usually written as 

c x 

b 2 ^2 
®3 ^3 C 3 

in which form it is referred to as a ‘determinant of order three’ 
or a ‘determinant of the third order’. The term a x b 2 c z is 
referred to as ‘the leading diagonal*. 

1.3. It is thus suggested that, associated with n linear 
equations in n variables, say 

a i%-\-b x y-\- ...-\-Jc x z = 0, 

H x + b 2V+-+K z = 0 , 


<*>n x + b ny+"'+K z = °I 
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there is a certain function of the coefficients which must be 
zero if all the equations are to be satisfied by a set of values 
,x, 2/,..., z which are not all zero. It is suggested that this 
[function of the coefficients may be conveniently denoted by 



bi • 

. 

h 


b 2 . 

■ 

k 2 

a n 

b n • 

. 

K 


n which form it may be referred to as a determinant of order n, 
md b 2 ... k ?1 called the leading diagonal. 

Just as we formed a determinant of order three (in § 1.2) by 
determinants of order two, so we could form a deter- 
linant of order four by using those of order three, and proceed 
itep by step to a definition of a determinant of order n. Rut this 
s not the only possible procedure and wc shall arrive at our 
lefinition by another path. 

We shall first observe certain properties of determinants of 
he third order and then define a determinant of order n in 
a way that these properties are preserved for deter- 
linants of every order. 

1 .4. Note on definitions. There are many different ways of defining 
determinant of order n, though all the definitions load to the same 
suit in the end. The only particular merit we claim for our own defini- 

i is that it is easily reconcilable with any of the others, and so makes 
oforence to other books a simple matter. 

1.5. Properties of determinants of order three. As we 

^ave seen in § 1.2, the determinant 

cq Cjl 

^2 ^2 
a 3 ^3 C 3 

iands for the expression 

+ ^62^3 — a l ^3 c 2 ~\~ 0'2 ^3 C 1 — c 3 ~f ~®3 C 2“ " af ' 3 ^ 2 C l (^) 

The following facts are all but self-evident: 

(I) The expression (1) is of the form 

2 ± °A C <> 
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wherein the sum is taken over the six possible ways of assigning 
to r,s,t the values 1, 2, 3 in some order and without repetition, 

(II) The leading diagonal term a x b 2 c 3 is prefixed by + . 

(III) As with the determinant of order 2 (§1.2), the inter- 

change of any two letters throughout the expression (1) repro- 
duces the same set of terms, but in a different order and with 
the opposite signs prefixed to them. For example, when a and b 
are interchanged! in (1), we get \ 

+ ^1 ^3 ^1 ^ 2 “ f “^ 2 a 3 ^1 ^2 ®1 ^ 3 “)” ^3 a i ^3 ®2 C l > 

which consists of the terms of (1), but in a different order and 
with the opposite signs prefixed. 

2 . Determinants of order n 

2.1. Having observed (§1.5) three essential properties of a 
determinant of the third order, we now define a determinant 
of order n. 

Definition. The determinant 

A n = b x . . j x lc x 

h (D 

I a n ■ • j n 

is that function of the a’s, b's Ic’s which sejisfies the three 
conditions : 

(I) it is an expression of the form 

^±a r b 8 ...k di (2) 

wherein the sum is taken over the n! possible ways of assigning 
to r, 5 ,..., 0 the values 1,~2,..., n in some order , and without 
repetition \ 

(II) the leading diagonal term , a x b 2 ...k n) is prefixed by the 
sign -f ; 

(III) the sign prefixed to any other term is such that the inter- 
change of any two letters% throughout (2) reproduces the same 

t Throughout we use the phrase 'interchange a and b* to denote the 
simultaneous interchanges a x and 6 lf a % and 6 t , o a and 6 8 , On and b n . 

% See previous footnote. The interchange of p and q , say, means the 
simultaneous interchanges 

Pi and q lt p t and p n and 
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set of term 3, but in a different order of occurrence , and with 
the opposite signs prefixed . 

Before proceeding we must prove that the definition yields 
one function of the a’ s, 6’s,..., fc’s and one only. The proof that 
follows is divided into four main steps. 

First step. Let the letters a, 6 ,..., 1c, which correspond to the 
columns of (1), be written down in any order, say 

..., d, g , a , ..., p , q, .... (A) 

An interchange of two letters that stand next to each other is 
called an adjacent interchange. Take any two letters ^ and 
q, having, say, m letters between them in the order (A). By 
ra-fl adjacent interchanges, in each of which p is moved one 
place to the right, we reach a stage at which p comes next 
after q; by m further adjacent interchanges, in each of which q 
is moved one place to the left, we reach a stage at which the 
order (A) is reproduced save that p and q have changed places. 
This stage has been reached by means of 2m+l adjacent 
interchanges. 

Now if, in (2), we change all the signs 2m+l times, we end 
with signs opposite to our initial signs. Accordingly, if the 
condition (III) of the definition is satisfied for adjacent inter- 
changes of letters, it is automatically satisfied for every inter- 
change of letters. 

Second step. The conditions (I), (II), (III) fix the value of 
the determinant (of the second order) 



to be a l b 2 — a 2 b v For, by (I) and (11^, the value must be 

H"®i b 2 ^a 2 6i, 

and, by (III), the interchange of a and b must change the signs, 
so that we cannot have a 1 b 2 -\-a 2 b 1 . 

Third step. Assume, then, that the conditions (I), (II), (III) 
are' sufficient to fix the value of a determinant of order n — 1. 
4 m c 
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By (I), the determinant A n contains a set of terms in which a 
has the suffix 1; this set of terms is 

( 3 ) 

wherein, by (I), as applied to A n , 

(i) the sum is taken over the (n— 1)! possible ways of 
assi g nin g to s, t,..., 6 the values 2, 3,..., n in some order 
and without repetition. 

Moreover, by (II), as applied to A„, 

(ii) the term b 2 c 3 . . . k n is prefixed by + . 

Finally, by (III), as applied to A„, 

(iii) an interchange of any two of the letters 6, k changes 
the signs throughout (3). 

Hence, by our hypothesis that the conditions (I), (II), (III) 
fix the value of a determinant of order n — 1, the terms of (2) in 
which a has the suffix 1 are given by 


b 2 c 2 


K c n 


k 0 
k s 


(3 a) 


This, on our assumption that a determinant of order n— 1 is 
defined by the conditions (I), (II), (III), fixes the signs of all 
terms in (2) that contain a 2 b 2 , a 1 6 3 ,..., a 1 b n . 

Fourth step. The interchange of a and b in (2) must, by 
condition (III), change all the signs in (2). Hence the terms of 
(2) in which b has the suffix 1 are given by 


- 6 , 



(3 b) 


for (3 b) fixes the sign of a term b 1 a l Ci... kg to be the opposite of 
the sign of the term a 2 b s c, ...kg in (3 a). 

The adjacent interchanges b with c, c with d, j with k 
now show that (2) must take the form 
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b 2 c 2 • • k 2 

-61 

C 2 - . k 2 

+Cl 

6 2 • . k 2 

63 c 3 . . k 2 


U 3 C 3 . . & 3 


u 3 63 . . k z 

bn c n • -. 


n 


ton 



®2 

b 2 . 

• J2 

- 

«3 

6 3 . 

• h 


<*n 

K ■ 

• 3n 


That is to say, if conditions (I), (II), (III) define uniquely a 
determinant of order n— 1 , then they define uniquely a deter- 
minant of order n. But they do define uniquely a determinant 
of order 2 , and hence, by induction, they define uniquely a 
determinant of any order. 

v^2. Rule for determining the sign of a given term. 

If in a term a r b 8 ...k e there are A n suffixes less than n that 
come after n, we say that there are A n inversions with respect 
to n. For example, in the term a 2 b 2 c i d l9 there is one inversion 
with respect to 4. Similarly, if there are A,^ suffixes less than 
7i—l that come after n— 1 , we say that there are A n _ x inversions 
with respect to n— 1 ; and so on. The sum 


N = A 1 +A 2 -[-...+A /i 


is called the total number of inversions of suffixes. Thus, with 
n = 6 and the term 


&8 C 2 ^6 Cjg, 


( 6 ) 


A 6 = 2, since the suffixes 1 and 5 come after 6, 

A 5 = 0, since no suffix less than 5 comes after 5, 

A 4 = 3, since the suffixes 3, 2, 1 come after 4, 

A3 — 2, A 2 = 1, Aj = 0; 

the total number of inversions is 2 + 3+2+ 1 = 8. 

If a r b B ... Ic 0 has A n inversions with respect to n 9 then, leaving 
the order of the suffixes 1, 2,..., n— 1 unchanged, we can make 
n to be the suffix of the 71 th letter of the alphabet by A w adjacent 
interchanges of letters and, on restoring alphabetical order, 
make n the last suffix. For example, in (5), where A n = 2, the 
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two adjacent interchanges / with e and / with d give, in 
succession, a.b^dj.e,, 

On restoring alphabetical order in the last form, we have 
a t b 3 c 2 d 1 e s f e , in which the suffixes 4, 3, 2, 1, 5 are in their 
original order, as in (5), and the suffix 6 comes at the end. 

Similarly, when n has been made the last suffix, A n _ 2 adjacent 
interchanges of letters followed by a restoration of alphabetical 
order will then make n— 1 the (n— l)th suffix; and so on. 

Thus A 1 +A 2 +...+A„ adjacent interchanges of letter^ make 
the term a r b a ...k g coincide with a l b 2 ... k n . By (III), the sign 
to be prefixed to any term of (2) is ( — 1)^, where N , i.e. 
Ai+A 2 +...+A„, is the total number of inversions of suffixes. 

v/2.3. The number N may also be arrived at in another way. 
Let 1 < m < K. In the term 

a r b B ...k e 

let there be /x m Buffixes greater than m that come before m. 
Then the suffix m comes after each of these fi m greater suffixes 
and, in evaluating N, accounts for one inversion with respect 
to each of them. It follows that 

N t Mm- (6) 

m -1 


3. Properties of a determinant 

3/l . Theorem 1. The determinant A n o/§ 2.1 can be expanded 
in either of the forma 

(i) K-l ) N a r b t ...k e , 

where N is the total number of inversions in the suffixes r, «,..., $; 

I+— + 


Jt 

is 


(ii) Oj 

^2 ^2 • ^2 

-K 

a 2 c 2 

. . k 2 




fl 3 C 3 

• • k 3 


K c n ■ - K 


a c 

■ • K 



+(- 

-1)^1 

|«* *>i . 


°S ^8 


«» K 


Jn 


This theorem has been proved in § 2. 
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Theorem 2. A determinant is unaltered in value when rows 
and columns are interchanged; that is to say 


«1 

£>i 

. k x 

== 


°2 • 

a n 

o 2 

b 2 

• k 2 


b , 

b 2 

bn 


K 

• kn 



k 2 

■ K 


By Theorem 1 , the second determinant is 

Z ( 7 ) 

where a, k are the letters a, fc in some order and M is 
the total number of inversions of letters. 

[There are fi n inversions with respect to k in aj3 ...* if there 
are \l u letters after k that come before k in the alphabet; and so 
on: is the total number of inversions of letters.] 

Now consider any one term of (7), say 

(-1 ( 8 ) 

If we write the product with its letters in alphabetical order, 
we get a term of the form 

(-1 )*%!>* ( 9 ) 

In (8) there are fi n letters that come after k t so that in (9) there 
are suffixes greater than 6 that come before 0. Thej-e are 
Mn-i letters that come before j in the alphabet but after it in 
(8), so there are suffixes greater than t that come before 
it in (9); and so on. It follows from §2.3 that M , which is 
defined as 2 i 8 equal to N, where N is the total number of 
inversions of suffixes in (9). 

Thus (7), which is the expansion of the second determinant 
of the enunciation, may also be written as 

) N a,b a ...k 0 , 

which is, by Theorem 1, the expansion of the first determinant 
of the enunciation; and Theorem 2 is proved. 

Theorem 3. The interchange of two columns , or of two rows , 
in a determinant multiplies the value of the determinant by — I. 

It follows at once from (III) of the definition of A n that an 
interchange of two columns, i.e. an interchange of two letters, 
multiplies the determinant by —1. 
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Hence also, by Theorem 2, an interchange of two rows 
multiplies the determinant by — 1. 

Corollary. .// a column (row) is moved past an even number 
of columns (rows), then the value of the determinant is unaltered; 
in particular 


°1 

by 

Cl 

dy 

= 

C 1 

® 1 

b i 

d i 

°2 

b 2 

C 2 

d' 2 


c 2 

a 2 

b 2 

dz 

«3 

b 3 

C 3 

d 3 


C 3 

a 3 

b 3 

d 3 

a 4 

b 4 

C 4 

dy 


C 4 

a t 

by 

< d* 


If a column (row) is moved past an odd number of columns 
(rows), then the value of the determinant is thereby multiplied 
by — 1. 

For a column can be moved past an even (odd) number of 
columns by an even (odd) number of adjacent interchanges. 
In the particular example, abed can be changed into cabd 
by first interchanging b and c, giving acbd , and then inter- 
changing a and c. 

3.11- The expansion (ii) of Theorem 1 is usually referred to 
as the expansion by the first row. By the corollary of Theorem 
3, there is a similar expansion by any other row. For example, 



h 

C 1 

dy 


*3 

'>3 

C 3 

d 3 

a 2 

b 2 

C 2 

d 2 


dy 


C 1 

dy 

a 3 

^3 

C 3 

d 3 

I 

a 2 

6 2 

C 2 

d 2 

« 4 

b 4 

C 4 

d 4 


°4 

*4 

c 4 

d 4 


and, on expanding the second determinant by its first row, the 
first determinant is seen to be equal to 

b x c x d x — b 3 a x c x d x -\~c 3 a x b x d x — d 3 a x b x c x 

b 2 c 2 d 2 a 2 c 2 d 2 b 2 d 2 a 2 b 2 c 2 

b^ c 4 c 4 cZ 4 b 4 d 4 u 4 ^ c 4 

Similarly, we may show that the first determinant may bo 

written as 

—a, b x c, d x +6 2 a, c x d x — c 2 a x b x d x +d 2 a x b, c x 

b * c 3 d 3 a 3 c 3 d 3 a 3 b 3 d 3 a 3 b 3 c 3 

c 4 d t a t c 4 d A o 4 b t d t a 4 6 4 c 4 
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AgaJfn, by Theorem 2, we may turn columns into rows and 
rows into columns without changing the value of the deter- 
minant. Hence there are corresponding expansions of the 
determinant by each of its columns. 

We shall return to this point in Chapter II, § 1. 

3.2. Theorem 4. If a determinant has two columns , or two 
rows, identical , its value is zero . 

The interchange of the two identical columns, or rows, 
leaves its value unaltered. But, by Theorem 3, if its value is x, 
its value after the interchange of the two columns, or rows, 
is — x . Hence x = —x, or 2x = 0. 

Theorem 5. If each element of one column , or row , is multi - 
2 )lied by a factor K , the value of the determinant is thereby multi- 
plied by K. 

This is an immediate corollary of the definition, for 
1 ±Ka r b s ...k g = K2±a r b s ...k g . 


3.3. Theorem 6. The determinant of order n, 


®l+ a i 

*i+0i • 

• *1 + *! 

a 2+“a 

*2+02 

. fc 2 + K 2 

®»+ a rt 

*H+0H ■ 

• &« + *» 


is equal to the sum of the 2" determinants corresponding to the 
2 n different ways of choosing one letter from each column; in 


particular, 


®i+“i 

®2+®2 


* 1+01 
*8+ 0a 


is the sum of the four determinants 


®1 *1 

» 

®1 01 

) 

“1 *1 

, 

*1 01 

®2 *8 


®2 08 


“8 *8 


“8 08 


This again is obvious from the definition; for 


2 ±(® r +“ r )(*«+0»)-(* : 0 + ' f «) 

is the algebraic sum of the 2 n summations typified by taking 
one term from each bracket. 
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3 . 4 , Theorem 7. The value of a determinant is unaltered if 
to each dement of one column (or row) is added a constant 
multiple of the corresponding element of another column (or row); 
in particular , 

a i b i c i = a i+Mi b i c i • 

a 2 b 2 c 2 a 2 +Xb 2 b 2 c 2 

a 3 b 3 c 3 a 3 +A6 3 63 c 3 

Proof. Let the determinant be A n of § 2.1, x and y ihe letters 
of two distinct columns of A n . Let A^ be the determinant 
formed from A n by replacing each x T of the x column by 
x r -\ -Xy n where A is independent of r. Then, by Theorem 1, 

=='2,(-l) Na r b 8--'( x u+tyu)---yt'-- ic 0 


a, 


X 1 

Vi 

K 

+ 

“1 

l>i 

fojl 

Vl ■ *1 


t>2 

x 2 

Vs 

2 


a 2 

b* 

hjs 

Vs • 


K • 

■ x n 1 

■ Vn ■ 

• K 


«« 

bn 

■ *Vn ■ 

■ Vn ■ K 


But the second of these is zero since, by Theorem 5 , it is A times 
a determinant which has two columns identical. Hence A„ = A ;i . 

Corollaries of Theorem 7 . There are many extensions of 
Theorem 7 . For example, by repeated applications of Theorem 7 
it can be proved that 

We may add to each column (or row) of a determinant fixed 
multiples of the subsequent columns (or rows) and leave the value 
of the determinant unaltered; in particular , 

by c x = Xby -\-fj.Cy by-\-vCy c x 

a 2 b 2 C 2 ®2 - l~A&2“f~/ AC 2 b 2^~ vC 2 C 2 

a 3 b 3 c 3 ®3+A6 3 +/iC 3 63+^3 c 3 1 

There is a similar corollary with preceding instead of subse- 
quent. 

Another extension of Theorem 7 is 

We may add multiples of any one column (or row) to every other 
column (or row) and leave the value of the determinant unaltered; in 
particular , 

by Cy = a x -\-\b X by Cy-\-pby 

a 2 b 2 c 2 a 2 +Xb 2 b 2 c 2 -\-pb 2 

a z b 3 c 3 flaH-A6 a 6 3 c 3 +^6 a 
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There are many others. But experience rather than set rule 
is the better guide for further extensions. The practice of 
adding multiples of columns or rows at random is liable to lead 
to error unless <each step is checked for validity by appeal to 
Theorems. For example, 

A = a 1 -\r}<b l 6 1 +/^c 1 Cj+vaj 

a 3+M>3 63 + ^3 C 3 + ^°3 

is, by Theorem 6, 

bi + A6 X fjiCi va 1 
bty A6g /x>c 2 ^2 

^3 ^3 ^3 A6 a /IC3 

all the other determinants envisaged by Theorem 6, such as 

A&i 6j Cj , 

A62 b 2 C2 
A63 63 c 3 1 

being zero in virtue of Theorems 4 and 5 . Hence the deter- 
minant A is equal to 

(1+A/av) a x b x c x . 

®2 ^2 ^2 
^3 hg Cg 

Note. One of the more curious errors into which one is led by adding 
multiples of rows at random is a fallacious proof that A = 0. In the 
example. just given, ‘subtract second column from first, add second and 
third, add first and third’, corresponds to A = — 1, /i = 1, v = 1, a choice 
of values that will (wrongly, of course) ‘prove’ that A = 0: the mani- 
pulation of the columns has not left the value unaltered, but multiplied 
the determinant by a zero factor, 1 + A (±v. 


3-5. Applications of Theorems 1-7. As a convenient 
notation for the application of Theorem 7 and its corollaries, 


we shall use 


r' k = Ir 1 +wr 1 +...+^ B 


to denote that, starting from a determinant £ n , we form a new^ 
determinant A* whose fcth row is obtained by taking l times the % 

4702 t> 
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first row plus m times the second row plus ...plus t times the 
»th row of A n . The notation 

c' n = Ac x +/*C a +... + »fC n 
refers to a similar process applied to columns. 


Examples I 

1. Find the value of 


87 

42 

3 

-1 

45 

18 

7 

4 

60 

17 

3 

-5 

91 

9 

6 

0 


The working that follows is an illustration of how one may deal with 
an isolated numerical determinant. For a systematic method of com- 
puting determinants (one which reduces the order of the determinant 
from, say, 6 to 5, frorfi 6 to 4, from 4 to 3, from 3 to 2) the reader is 
referred to Whittaker and Robinson, The Calculus of Observations 


(London, 1926), chapter v. 







On writing c[ = c x 

— 2c a - 

-c 3 , c' 3 

— c a +3c 4 , 



A = 0 

42 

0 

-1 





2 

18 

19 

4 





13 

17 

-12 

-5 





07 

9 

6 

0 

■ 




42 

2 

19 

4 

+ 

2 

18 

19 


13 

-12 

-5 


13 

17 

-12 


67 

6 

0 


07 

9 

6 


by Theorem I. On expanding the third -order determinants, 


A = -42{2(30)-13(-24) + 67(-95 + 48)}+ 

+{2(102 + 108)— 13(108 — 171) + 67( — 216— 323)}, etc. 

2. Prove thatf 

1 1 1 | = (a-j3)(j8-y)(y-a). 

ot p y 
P y ya &P 

On taking c£ = c,— c 1 , cj = c,— c,, the determinant becomes 

10 0 
Ot P-rOL Y—P 
Py y(a-/3) a(j8-y) 

t This determinant, and others that occur in this set of examples, can be 
evaluated quickly by using the Remainder Theorem. Here they are intended 
as exercises on §§ 1-3, 
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which, by Theorems 1 and 5, is equal to 

(at— /9)(/9— y) I -1 -1 

I y “ 

i.e. {ai—f})(f}—y)(y—ot). 

3. Prove that 

A = | 0 (a— P) 9 ( oL—y ) 2 (a— S) 8 | = 0. 

0 (j9-y)« (/3-S)* 

(y_ a) a (y-.fr* 0 (y-S)» 

(S-a)» (S— JS)« (8-y) s 0 

When we have studied the multiplication of determinants we shall 
see (Examples IV, 2) that the given determinant is ‘obviously* zero, 
being the product of two determinants each having a column of zeros. 
But, at present, we treat it as an exercise on Theorem 7. Take 

r\ — r 1 ~r 2 and remove the factor a— jS from r[ (Theorem 5), 
r 2 ~ r t ~ r 3 and remove the factor ft — y from r 2 , 
rj = r a — r 4 and remove the factor y — 8 from r 2 . 

Then A/(«— /3)(/3 — y)(y— 8) is equal to 

— 0 £ + /3 a — P a-\-p — 2y a-j-j3 — 28 

p+y— 2a — j8-i-y fi—y jS+y — 28 

y-J-8 — 2a y + 8 — 2j8 — y + 8 y — 8 

(8 — a) 2 (8— j3) 2 (8 — y) a 0 

In this take cj = Ci— c >f c 3 = c a —c 3 , cj c 3 — c 4 and remove the factors 
j8 — a, y—fi, 8 — y; it becomes 

(p— ot)(y — P)($ — y) 2 2 2 at+jS— 28 

2 2 2 £+y-2S 

2 2 2 y — 8 

28 — a — P 28— P— y 8— y 0 

In the last determinant take rj = r x — r 8 , rj = r a — r a ; it becomes 
0 0 0 OL—y 

0 00 p—B 

2 2 2 y — 8 

28 — a — P 28 — P~y 8 — y 0 

which, on expanding by the first row, is zero since the determinant of 
order 3 that multiplies oc—y is one with a complete row of zeros. 

4. Prove that 

(a-jS) 8 (a— y) 8 =0. 

0 </3-y)» 

(y-/3) s 0 


0 

(fi— a)* 

(y — at)* 


6 ll 3 
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^ Prove that 

* p v 

a* jS 1 y* 
0+y y+a “+0 

Hint. Taker', = n+r,. 
6. Prove that 


= (a+/r+y)(/3-y)(y-“K“-0)- 


7. Prove that 


1 

a 

P+Y 

8 


0 

0 2 -* 2 


1 

P 

y+8 

a 

a 2 -/9 2 

0 


y 

8 + a 

P 


1 

8 

a+j9 

y 


= o. 




2 — a 2 y 2 — j9 2 
8 2 -a 2 8 2 — )0 2 


a 2 — y 2 
j8 2 -y 2 
0 

8 2 — y 2 


a 2 — 8 2 
/? 2 -8 2 
y 2 — 8 2 
0 


= 0. 


Hint. Use Theorem 6. 

, Prove that 

2a a +/? 

0+a 2 p 

y+a y+£ 

8-fa 8 +/3 


a + y 

0+y 

2y 

8+y 


a + 8 

/S+8 

y+8 

28 


= 0. 


Prove that 
1 1 
« P 

1 8 y 8 y8a 
a 2 p* 


1 

y 

8aj8 

y a 


i 

8 

“0y 

8* 


= (a— j8)(a— y)(a— S)(£— y)(£— 8)(y— 8). 


10. Prove that 
A = 


a 

0 


0 

— a 
-8 -d 
— c — e 


8 

d 

0 

-f 


= (a/— 8e+ cd)*. 


The determinant may be evaluated directly if we consider it as 

Oi 8^ Cj dj 

°2 c a d* 

a 9 ^8 C 3 ^3 

°4 ^4 C 4 ^4 I 

whose expansion is £ (— 1 ) N a r b a c t d u , the value of iV being determined 
by the rule of § 2.2. There are, however, a number of zero terms in the 
expansion of A. The non-zero terms are 
«y* [a a 8 1 c 4 d s and so is prefixed by (—1)*], 

6V, c 9 d*, each prefixed by -f , 
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two terms aebf [one a % & 4 c x d 3 , the other a 3 b x c 4 d 2 , and so each prefixed 
by ( — l) a ]* 

two terms adfc prefixed by + , and two terms cdbe each prefixed by — . 
Hence A = a 8 / 8 + & a e a + c a d a — 2ae6/+2a/cd— 26ecd. 

11. Prove that the expansion of 

0 c b d 

c 0 a e 

b a 0 / 

d e f 0 

aPd 1 + 6 2 e a + c 8 / 2 — 26ce/— 2 cafd — 2abde. 

12. Prove that the expansion of 

1 1 

0 y* 

z a 0 x a 
y 8 0 

is 2^—2 Z y 2z *' 

13. (Harder.) Express the determinant 

x + o bed 
b x+c d a 

o d x+a 6 

dab x+c 

as a product of factors. 

Hint. Two linear factors x+a+c±(6+d) and one quadratic factor. 

14. From a Seterminant A, of order 4, a new determinant A' is 
formed by taking 

C| = Cj+ACj, Cg = Cj-f-/XC a , Cg = Cg-f-VCj, c 4 = c 4 . 

Prove that A' = (l+A/iv)A. 

4. Notation for determinants 

The determinant A n of §2.1 is sufficiently indicated by its 
leading diagonal and it is often written as b z c 3 ... k n ). 

The use of the double suffix notation, which we used on p. 3 
of the Preliminary Note, enables one to abbreviate still further. 
The determinant that has a n as the element in the rth row 
and the sth column may be written as |a„|. 

Sometimes a determinant is sufficiently indicated by its first 
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row; thus (ai& 2 c 3 -*-^n) ma Y indicated by |a 1 b 1 c 1 .. a £ 1 | l but 
the notation is liable to misinterpretation. 

5) Standard types of determinant 
5.1. The product of differences: alternants. 

Theorem 8. The determinant 


a 3 

P 3 

y 3 

8 3 

! i 

jl 

a 2 

P 2 

y 2 

8 2 

\ 

a 

P 

y 

8 

V 

1 

i 

l 

1 

\- 

\ 


is equal to (a— /J)(a— y)(a— 8)(/3— y)(|5— 8)(y— S); the correspond- 
ing determinant of order n, namely (a™ -1 /}" -2 ... 1), is equal to the 
product of the differences that can be formed from the letters 
a, k appearing in the determinant, due regard being paid 
to alphabetical order in the factors. Such determinants are called 

ALTERNANTS. 

On expanding A 4 , we see that it may be regarded 

(a) as a homogeneous polynomial of degree G in the variables 
a, j3, y, 8, the coefficients being + 1 ; 

(b) as a non-homogeneous polynomial of highest degree 3 in 
®> ^e coefficients of the powers of a being functions of 
P> V, 8 . 

The determinant vanishes when a. = /J, since it then has two 
columns identical. Hence, by the Remainder Theorem applied 
to a polynomial in a, the determinant has a factor ct-fl. By 
a similar argument, the difference of afty two of a, j8, y, 8 is a 
factor of A 4 , so that 

A 4 = K(ot— 0)(a-y)(a— S)03— y)(£— 8)(y-S). (10) 

Since A 4 is a homogeneous polynomial of degree 6 in a, /3, y, 8, 
the factor K must be independent of a, 0, y, 8 and so is a 
numerical constant. The coefficient of a 3 /3 2 y on the right-hand 
side of (10) is K, and in A 4 it is 1. Hence K = 1 and 

A 4 = (a— /J)(a— y)(a— 8)(£— y)(j8— 8)(y— 8). 

The last product is conveniently written as £(<x,j8,y,S), and 
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the corresponding product of differences of the n letters 
a , k as £(<*, k). This last product contains 

(n— 1)+ (?i— 2)-f-... + l 

factors and the degree of the determinant (a n “ 1 ]8 ri “ 2 ... 1) is 
also (n— 1)+(^— 2)+... + l 4 so that the argument used for A 4 
is readily extended to A n . 

5 . 2 } The circulant. 

Theorem 9. The determinant 

A = a x a 2 a 3 . . a n = IT 

a n a l a 2 ■ • 

E n-1 a l • - a n ~ 2 

®3 

wAere the 'product is taken over the n-th roots of unity. Such a 
determinant is allied a circulant. 

Let oj be any one of the n numbers 

w k = cos |-isin (A; = 1,2,..., ft). 

n n 

In A replace the first column c x by a new column c^, where 

c i = c 1 +(x)C 2 +oj 2 c 3 +.„+iu n ‘ 1 C n . 

This leaves the value of A unaltered. 

The first column of the new determinant is, on using the fact 
that (o n = 1 and writing a 1 +a 2 a>+ ...+a n a> n ” 1 = a, 

= a, 

a n _icu n +a n co u+1 +a 1 cD a +...+®n«2 a,n “' 1 ~ 

a, a> n +o 3 a) B+1 +...+o n <o 2n - s +a 1 w n_1 = a>*~ 1 a£. 

Hence the first oolumn of the new determinant has a factor 
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a = a 1 +a 2 w+...+a n a> n -\ which is therefore (Theorem 6) a 
factor of A. This is true for to = u> k (k = 1,2 ,...,»), so that 

A = A'JJ (a 1 +a 2 co fc +...+a„<og- 1 ). (11) 

Moreover, since A and are homogeneous of degree » in the 
variables a v a 2 ,..., a n , the factor K must be independent of these 
variables and so is a numerical constant. Compajdng the 
coefficients of a\ on the two sides of (11), we see that If = 1. 

6. Odd and even permutations ' 

6.1. The n numbers r, s 6 are said to be a permutation 

of 1 , 2 ,..., n if they consist of 1 , 2 ,..., n in some order. We can 

obtain the order r, a,..., 6 from the order 1, 2 n by suitable 

interchanges of pairs; but the set of interchanges leading from 
the one order to the other is not unique. For example, 
4 32 615 becomes 12 3 45 6 after the interchanges denoted by 

/43 26 I5\ /I 3 2 6 4 5\ (1 2 3 6 4 «\ /I 2 3 4 6 5\ 

\1 3 2 6 4 5/’ \1 2 3 6 4 5/’ U 2 3 4 6 g)' \l 2 3 4 5 6/’ 

whereby we first put 1 in the first place, then 2 in the second 
place, and so on. But we may arrive at the same final result by 
first putting 6 in the sixth place, then 5 in the fifth, and so on: 
or we can proceed solely by adjacent interchanges, beginning by 

/4 3 2 6 1 5\ /4 3 2 1 6 5\ 

\4 3 2 1 6 5/’ [4 3 1 2 6 5)’ 

as first steps towards moving 1 into the first place. In fact, as 
the reader will see for himself, there is a wide variety of sets of 
interchanges that will ultimately change 43 2615 into 123 456. 

Now suppose that there are K interchanges in any one way 
of going from r, a,..., 6 to 1, 2 ,..., n. Then, by condition III in 
the definition of a determinant (vide § 2.1), the term a r b t ...kg 
in the expansion of A n = (a ± b a ...k n ) is prefixed by the sign 
(— l)^. But, as we have proved, the definition of A n by the 
conditions I, II, III is unique. Hence the sign to be prefixed 
to any given term is uniquely determined, and therefore the 
sign ( — l) x must be the same whatever set of interchanges is 
used in going from r, a,..., 6 to 1, 2 ,..., n.‘ Thus, if one way of 
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going from r, 5,..., 0 to 1, 2,..., n involves an odd (even) number 
of interchanges, so does every way of going from r, 5,..., 0 
to 1, 2 ,..., n. 

Accordingly, every permutation may be characterized either 
as even, when the change from it to the standard order 1,2 n 
can be effected by an even number of interchanges, or as odd, 
when the change from it to the standard order 1, 2,..., n can 
be effected by an odd number of interchanges. 

6.2. It can be proved, without reference to determinants, 
that, if one way of changing from r, 5,..., 6 to 1, 2,..., n involves 
an odd (even) number of interchanges, so does every way of 
effecting the same result. When this has been done it is 
legitimate to define A n == (a 1 b 2 ... k n ) as 2 - where 

the plus or minus sign is prefixed to a term according as its 
suffixes form an even or an odd permutaticfri of 1 , 2 ,..., n. 
This is, in fact, one of the common ways of defining a deter- 
minant. 

7. Differentiation 

When the elements of a determinant are functions of a 
variable x , the rule for obtaining its differential coefficient is as 
follows. If A denotes the determinant 



and the elements are functions of x, dAJdx is the sum of the 
n determinants obtained by differentiating the elements of 
one row (Or column) of A and leaving the elements of the other 
n— 1 rows (or columns) unaltered. For example, 

d x 2 x 1 
dx x 3 x 2 x 
x 4 x 3 x 2 



2x 

l 

0 

+ 

a: 2 

X 

i 

+ 

X 2 

X 

1 


x 2 

X 2 

X 


3a: 2 

2x 

i 


X 2 

X 2 

X 


; X* 

a: 3 

X 2 

1 ; 

a: 4 

X 3 

X 2 


4a^ 

Zx 2 

2x 


4702 


E 



26 ELEMENTARY PROPERTIES OF DETERMINANTS 


The proof of the rule follows at once from Theorem 1 ; since 
A = 2 (— 1 ) N a r b,...k 0 , 

the usual rules for differentiating a product give 

Since, for example, 

2 (-!>*£*.-■*»= S • • £■ 

* , ■ ■ K 


dA/dx is the Tsum of n determinants, in each of which one row 
consists of differential coefficients of a row of A and the remain- 
ing rc— 1 rows consist of the corresponding rows of A. 


Examples II 


1. Prove that 


1 

P+y 

P+Y* = 

1 

a 

a 2 

1 

y+a. 

y 2 + a 2 

1. 

/3 

P 

1 

OL + P 

ac 2 +j8 2 

1 

Y 

y* 




Hint. Take c 2 = c 2 —s 1 c lf c' 3 = c z —8 % c x , where 8 r = a r -\ P r +y r . 

2. Prove that 

1 oc fiy = -f(o c,p,y). 

1 p yoc 

1 y <xP 

3. Prove that 

1 P+y ( P+y ) 2 = £(a,p,y). 

1 y + a (y-f ot) a 

1 oc+P (a+0) a 

4. Prove that each of the determinants of the fourth order whose 
first rows are (i) 1, j3+y-f- 8, a a , a 3 ; (ii) 1, a, jS 2 +y 2 +8 2 , j8y8, is equal to 
±f(a,j8,y,8). Write down other determinants that equal ±£(a,j3,y,8). 

5. Prove that the ‘skew* circulant 

I ^ a 2 °3 ... a n I = FI ( a i~ho a cii-f ... +o„cu n “ 1 )» 



a a 

°3 



. 


n 

Oi 

a a 



• 

°n-l 

t-i 

“Of. 

a i 




<*n-t 

>2 

-o» 

“»4 



. 



where w runs through the n roots of — 1. 
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6. Prove that the circulant of order 2 n whose first row is 

a i °i a a — 

is the product of a circulant of order n (with first row Oi+a n+lf 
^ 2 + a n+a»-") a^d a skew circulant of order n (with first row 

7. Prove that the circulant- of the fourth order with a first row 
a i9 a v a z> a \ is equal to 

{( a i + ®a) 2 “ (flj + ®4) 2 }{( a i “ °a) 2 + ( a z ~~ °4) 2 }* 

8. When Q(x) == (x—a^x— a 2 )...(x— a n ), prove that 


1 

1 

. . 1 


1 . 

. . 1 

QW ~ 

a i 

. . a n 


a, 

. . a n 


af" a 

• ■ 


a,"-* . 

■ ■ or* 



• • (z-aj- 1 


a?-* . 

• • o»~ i 


9. By putting x = y* 1 in Example 8 and expanding in powers of y, 
show that n H v , the sum of homogeneous products of o lP a 2 ,..., a n of 
degree p, is equal to 



1 10' Prove that 

111= (j8-y)(y-a)(a-/3)(«+j8+y)- 
“ P y 
* P y* 

Hint. The first three factors are obtained as in Theorem 8. The 
degree of tho determinant in a, jS, y is four ; so the remaining factor 
must bo linear in a, j8, y and it must be unaltered by the interchange 
of any two letters (for both the determinant and the product of the 
first throe factors are altered in sign by such an interchange). 

Alternatively, consider the coefficient of 8® in the alternant {(a, j3, y, 8). 

11. Prove that 

111= (/}-y)(y-a)(a-j8)(j3y+ya+tt]B). 
a* 1 y® 
a® fJ* y s 

12. Extend the results of Examples 10 and 11 to determinants of 
higher order. 



CHAPTER II 

THE MINORS OF A DETERMINANT 

1 . First minors 

1 . 1 . In the determinant of order n, 

A„ = I Oi b x . . k 2 

a n K . . k n 

the determinant of order n— 1 obtained by deleting the row 
and the column containing a T is called the minor of a r \ and so 
for other letters. Such minors are called first minors; the 
determinant of order n— 2 obtained by deleting the two rows 
with suffixes r, s and the two columns with letters a, b is called 
a second minor; and so on for third, fourth,... minors. We 

shall denote the first minors by a r , /3 r i 

We can expand A„ by any row or column (Chap. I, § 3.11); 
for example, on expanding by the first column, 

= a i a i- a 2 Q ‘i+---+(- 1 )”' 1 «/i“«> * (1) 

or, on expanding by the second row, 

K = —a 2 u t +b 2 p 2 — ...+(— \) n k 2 K 2 . ( 2 ) 

1 . 2 . The above notation requires a careful consideration 
of sign in its use and it is more convenient to introduce 
co-factors. They are defined as the numbers 

A n B r ,...,K T (r = 1, 2,.„,n) 
such that A n = a l A 1 -\-a 2 A 2 -j-...-\-a n A n 
= b 1 B l -\-b 2 B 2 -\-...-{-b n B n 

— k 1 K 1 -\-k 2 K 2 -{- ...-\-k n K n , 

these being the expansions of A n by its various columns, and 

A n = a X A 2 -\~bi ■£?!+. ..-}-&! K 2 

= a nA n +b n B n +... +k n K n , 

these being the expansions of by its various rows. 
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It follows from the definition that A r , are obtained by 
prefixing a suitable sign to a r , j8 r ,... . 

1.3. It is a simple matter to determine the sign for any 
particular minor. For example, to determine whether C 2 is 
+y 2 or — y 2 we observe th^t, on interchanging the first two 
rows of A w , 


A„= — 


b 2 . 

k 

- A' 2 


a i 

K - 

. k l 


a 3 

b 3 

■ ^3 


a n 

b n \ 

• K 


— — «2 a 2 + ^2^2 — C 2 y 2 +-> 

while, by the definition of the co-factors 
so that C 2 — y 2 . 

Or again, to determine whether D 3 is +S 3 or — S 3 we observe 
that A n is unaltered (Theorem 3, Corollary) if we move the third 
row up until it becomes the first row, so that, on expanding by 
the first row of the determinant so formed, 

A n = &3 CX3 b 3 f} 3 ~\~C 3 y 3 ^3 ^3~1~ ■ * ■ ■ 

But A n = a 3 A 3 -\-b 3 B 3 -\-c 3 C 3 -\-d 3 D 3 -\- 

and so D 3 = — 8 a . 

1.4. If we use the double suffix notation (Chap. I, § 4), the 
co-factor A r8 of a rH in |o rg | is, by the procedure of § 1.3, ( — l) r+s ~ 2 
times the minor of a r3 ; that is, A r8 is ( — l) r +* times the deter- 
minant obtained by deleting the rth row and sth column. 

2. We have seen in § 1 that, with A n = (a x b 2 ... k n ), 

A n = a r A r -\-b r B r +...-\-k r K r . (3) 

If we replace the rth row of A M , namely 

b r ... 

by a 9 b 8 ... k s , 

where s is one of the numbers 1 , 2 ,..., n other than r, we thereby 
get a determinant having two rows a 8 b B ... k 8 ; that is, we get a 
determinant equal to zero. But A r) B r) ..,, K r are unaffected by 
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such a change in the rth row of A n . Hence, for the new deter- 
minant, (3) takes the form 

0 = a 8 A r -f-b 8 fi r + ...-\-k 8 K r (4) 

A corresponding result for columns may be proved in the 
spine way, and the results summed up thus: 

The sum of elements of a row (or column) multiplied fry their 
own co-factors is A; the sum of elements of a row multiplied by the 
corresponding co-factors of another row is zero; the sum^of ele- 
ments of a column multiplied by the corresponding co-fadtors of 
another column is zero. 

We also state these important facts as a theorem. 

Theorem 10. The determinant A = (a x b 2 ... k n ) may be 
expanded by any row or by any column : such expansions take 
one of the forms 

A = a r A r -\-b r B r -\-...-\-k r K ri (5) 

A = x l X 1 +x 2 X 2 -\-...-\-x n X nf , (6) 

where r is any one of the numbers 1 , 2 ,..., n and x is any one of 
the letters a,b,...,k. Moreover , 

0 = a 8 A r -{-b 8 B r +...+k 8 K r} (7) 

0 = yiX 1 +y 2 X 2 +...+y n X n , (8) 

where r , s are two different numbers taken from 1 , 2 ,..., n and 
x, y are two different letters taken from a, 6 ,..., k. 


3. Preface to §§ 4-6 


We come now to a group of problems that depend for their 
full discussion on the implications of ‘rank’ in a matrix. This 
full discussion is deferred to Chapter VIII. But even the more 
elementary aspects of these problems are of considerable 
importance and these are set out in §§ 4-6. 

f \ 

^4yThe solution of non -homogeneous linear equations 
4.1. The equations 

0 'i^+b l y+c x z = 0, a 2 x+b 2 y+c 2 z = 0 


are said to be homogeneous linear equations in x, y, z\ the 
equation* ^ a>x+bay _ 

are said to be non-homogeneous linear equations in x f y. 



THE MINORS OF A DETERMINANT 


31 


\4^2. If it is possible to choose a set of values for x, j/,..., t so 
that the m equations 

a‘ r x+b r y+ ...+k r t = l r (r = 1,2 ,...,ra) 
are all satisfied, these equations are said to be consistent. If 
it is not possible so to choose the values of x, y,..., t , the equa- 
tions are said to be inconsistent. For example, x+y = 2, 
x—y = 0, 3a:— 2 y — 1 are consistent, since all three equations 
are satisfied when x = 1, y = 1; on the other hand, x+y = 2, 
x—y = 0, 3x— 2y = 6 are inconsistent. 

4.3. Consider the n non-homogeneous linear equations, in 
the n variables x, y,..., t, 

a r x+b r y+... + k r t = l r (r = 1,2, ...,»). (9) 

Let A = (a A b 2 ... k n ) and let A rf B r} ... be the co-factors of 
a ri 6 r ,... in A. 

Since, by Theorem 10, 2 a r A r = A, J b r A r = 0,..., the result 
of multiplying each equation (9) by its corresponding A r and 
adding is 

Ax+0.y+... + 0. t = l x A x + ...+l n A n \ 
that is. Ax = (l x b 2 c 3 ... k n ), (10) 

the determinant obtained by writing l for a in A. 

Similarly, the result of multiplying each equation (9) by its 
corresponding B r and adding is 

Ay = Zi B x + ...+l n B n = (a 1 l 2 c 3 ... k n ), (11) 

and so on. 

When A ^ 0 the equations (9) have a unique solution given 
by (10), (11), and their analogues. In words, the solution is 
‘A.x 13 equal to the determinant obtained by putting l for a in A; 
A .y is equal to the determinant obtained by putting l for b in A; 
and so on \ f 

When A = 0 the non-homogeneous equations (9) are incon- 
sistent unless each of the determinants on the right-hand sides 
of (10), (11), and their analogues is also zero. When all such 
determinants are zero the equations (9) may or may not be con- 
sistent: we defer consideration of the problem to a later chapter. 

t Some re&dera may already be familiar with different forms of setting out 
this rosult — the differences of form are unimportant. 



32 THE MINORS OF A DETERMINANT 

& The solution of homogeneous linear equations 

Theorem 11 . A necessary and sufficient condition that values , 
not all zero , may be assigned to the n variables x , y,... 3 t so that 
the n homogeneous equations 

a r x-\-b r y~\- ...-\-k r t = 0 (r = 1,2,...,%) (12) 

hold simultaneously is (a x b 2 ... lc n ) = 0. j 

5.1. Necessary. Let equations (12) be satisfied byvalues 
of x, i/,..., f not all zero. Let A = {a 1 b 2 ... 4J and let A r , be 
the co-factors of a r% b n ... in A. Multiply each equation (12) by its 
corresponding A r and add; the result is, as in §4.3, A* = 0. 
Similarly, Ay = 0 , Az = 0 ,..., At = 0. But, by hypothesis, at 
least one of x, y,..., t is not zero and therefore A must be zero. 

5|.2. Sufficient. Let A = 0. 

V 5.21. In the first place suppose, further, that A x ^ 0. 
Omit r = 1 from (12) and consider the n—1 equations 

b r y+c r z+...+1c r t = —a r x (r = 2, 3,...,w), (13) 

where the determinant of the coefficients on the left is A v 
Then, proceeding exactly as in § 4.3, but with the determinant 
A 1 in place of A, we obtain 

A 1 y = (a 2 c 3 ... k n )x — B x x> 

A x z = — (b 2 a 3 d 4 ... k n )x = C x x, 
and so on. Hence the set of values 

x ~ y = B x £, z = C x £, ..., 

where £ ^ 0, is a set, not all zero (since A x £ 0), satisfying the 
equations (13). But 

= A = 0, 

and hence this set of values also satisfies the omitted equation 
corresponding to r = I. This proves that A = 0 is a sufficient 
condition for our result to hold provided also that A x ^ 0. 

If A x = 0 and some other first minor, say C 8i is not zero, an 
interchange of the letters a and c, of the letters x and s, and of 
the suffixes 1 and s will give the equations (12) in a slightly 
changed notation and, in this new notation, A x (the C 8 of the 
old notation) is not zero. It follows that if A = 0 and if any one 
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first minor of A is not zero, values, not all zero, may be assigned 
to x, y,..., t so that equations (12) are all satisfied. 

5.22. Suppose now that A = 0 and that every first minor of 
A is zero; or, proceeding to the general case, suppose that all 
minors with more than R rows and columns vanish, but that at 
least one minor with R rows and columns does not vanish. 
Change the notation (by interchanging letters and interchang- 
ing suffixes) so that one non-vanishing minor of R rows is the 
first minor of in the determinant (a 1 b 2 ... e R+1 ) t where e 
denotes the (ii-f-l)th letter of the alphabet. 

Consider, instead of (12), the i?+l equations 

a r x +b r y+...+e r \ = 0 (r = 1, 2 ,..., R+l), (12') 

where A denotes the (/?+l)th variable of the set x, y,... . The 
determinant A' = (a x b 2 ... = 0, by hypothesis, while the 

minor of a x in A' is not zero, also by hypothesis. Hence, by 
§5.21, the equations (12') are satisfied when 

x = y = A = E lf (14) 

where A[, B[,„. are the minors of a v 6 lv .. in A'. Moreover, 

a; ^0. 

Further, if 1 < r < n, 

being the determinant formed by putting a r , b rf ... for a v 6 lf ... in 
A', is a determinant of order iZ+1 formed from the coefficients 
of the equations (12); as such its value is, by our hypothesis, 
zero. Hence the values (14) satisfy not only (12'), but also 

a r z+b r y+...+e r A = 0 (i?-f 1 < r < n). 

Hence the n equations (12) are satisfied if we put the values (14) 
for x , y,..., A and the value zero for all variables in (12) other 
than these. Moreover, the value of a; in (14) is not zero. 


6- The minors of a zero determinant 

Theorem 12. If A = 0, 

A r B, = A t B r , A r c t = a,c t , ... 


4702 
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7/ A = 0 and if, further, no first minor of A is zero, the co- 
factors of the r-th row ( column ) are proportional to those of the 
s-th row (column); that is, 



Consider the n equations 

a r x+b T y+...+k r t = 0 (r = 1, 2,...,n). ^ (!5) 

They are satisfied by x = A v y = t = K x \ for, by 

Theorem 10, 

A Y -)-&! B 1 +...+ Ai = A = 0 , 

a r A 1 -^b r B 1 +...+k r K l = 0 (r = 2,3,...,w). 

Now let s be any one of the numbers 2, 3,..., n, and consider 
the n— 1 equations 

b r y+c r z+...+k r t = -*a r x (r ^ s). 

Proceeding as in § 4.3, but with the determinant A s in placfe of 
A, we obtain 

( b i c z ... k n )y = -Kc 2 ... k n )x, 

(b 1 c 2 ... k n )z = a 2 d 3 ... k n )x, 
and so on, there being no suffix s. That is, we have 
A s y=B s x, A„z = C 8 x, .... 

These equations hold whenever x, y,..., t satisfy the equations 
(15), and therefore hold when 

x = A v y - B v ..., '■ t = K v 

Hence A S B 1 = A 1 B S , A& = A&, .... (16) 

The same method of proof holds when we take 

x = A r , y = B r , ..., t = K,, 

with r ^ 1, as a solution of (15). Hence our general theorem is 
proved. 

If no first minor is zero, we may divide the equation 
A r B s = A e B r by A S B S , etc., and so obtain 

^ = r B_r = = K, 

A, B, - Kf 


(17) 
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Working with columns instead of rows, we have 



and so for other columns. 


Examples III 

# 1. By solving the equations 

a r x+b r y+c T z+d T t = oJ (r = 0, 1, 2, 3), 

prove that 

2 ( a— 6)(at— c)(g— d) r _ 

(a— 6)(a— c)(o— d) 3 a ’ ’ ’ 

a 

Obtain identities in a, 6, c, d by considering the coefficients of powers 
of a. 


2. If A denotes the determinant 

a h g 
h b f 
1 9 f c 

and (?, F, <7 are the co -factors of g, f, c in A, prove that 
aG z -J- 2hFQ + bF 2 -\- 2gGC+2fFC-\-cC 2 = CA. 

Hint. Use aQ+hF+gC = 0, etc. 

3. If the determinant A of Example 2 is equal to zero, prove that 
BC = F 2 f GH = AF,... f and hence that, when C ^ 0, 

a(ax*+2hxy + by*+2gx + 2fy+c) = (ax +hy+g)*+^(Cy— F) 2 . 

Prove also that, if a,..., h arc real numbers and C is negative, then 
so also are A and B . [A = be — / a , F — gh—qf, etc.] 

4. Prove that the three lines whose cartesian equations are 

a r x+b r y + c r = 0 (r = 1, 2, 3) 
are concurrent or parallel if (a L b t c 3 ) = 0. 

Hint. Make equations homogeneous and use Theorem 11. 

5. Find the conditions that the four planes whose cartesian equations 
arc a r x-\-b r y + c T z+d r = 0 (r = 1, 2, 3, 4) should have a finite point in 
common. 

6. Prove that the equation of the circle cutting the three given circles 

x % +y 2 -\-2g T x+2f T y+c t = 0 (r = 1, 2, 3) 
orthogonally is 

x*+y* — x —y 1 | = 0. 

Ci 0i /i 1 

c g 0g /* 1 

C 3 08 fi 1 
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By writing the last equation as 

ci+giX+Ay Qi+x A+y I = o, 

c»+ 9 tx+f t y g t +x f t +y 
<>i+9»x+f s y g a +x f a +y 

prove that the circle in question is the locus of the point whose polars 
with respect to the three given circles are concurrent. 

9 7. Express in determinantal form the condition that foiu| given 
circles may have a common orthogonal circle. 

8. Write down 1111 

ol pyx 
* 2 p 2 / x 2 
a 3 j3 3 y 3 x 3 

as a product of factors, and, by considering the minors of x , x 2 in the 
determinant and the coefficients of x , x 2 in the product of the factors, 
evaluate the determinants of Examples 10, 11 on p. 27. 

9. Extend the results of Example 8 to determinants of higher orders. 

7 . Laplace’s expansion of a determinant 


h 

k, 

kn I 

I(— l) N a r b B c,...k 9 , (18) 

where the sum is taken over the »! ways of assigning to r,a,...,8 
the values 1, 2,..., n in some order and N is the total number of 
inversions in the suffixes r, a,..., 8. 

We now show that the expansions of A n by its rows and 
columns are but special cases of a more general procedure. The 
terms in (18) that contain a p b q , when p and q are fixed, con- 
stitute a sum 

a p ^q{AB) pq = Qpb q 2 ±c t ... kg (19) 

in which the sum is taken over the (n— 2)! ways of ali gning 
to t,,.., 8 the values 1, 2,,.., n, in some order, excluding p and j. 
Also, an interchange of any two letters throughout (19) will 
reproduce the same set of terms, but with opposite signs pre- 


7.1 . The determinant 

K = 


K 


i ^ n C n 


can be expressed in the form 
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fixed, sinoe this is true of (18). Hence, by the definition of a 
determinant, either -\-(AB) pq or — (AB) pq is equal to the deter- 
minant of order n — 2 obtained by deleting from A n the first and 
second columns and also the pth and qth rows. Denote this 
determinant by its leading ..diagonal, say ( c t d u ...k e ), where 
t, 6 are 1,2 n in that order but excluding p and q. 

Again, since (18) contains a set of terms a p b q (AB) pq and an 
interchange of a and b leaves (AB) pq unaltered, it follows 
(from the definition of a determinant) that (18) also contains 
a set of terms —a q b p (AB) pq . Thus (18) contains a set of terms 
(a p b q -a q b p )(AB) pq . 

But we have seen that (AB) pq = ±(c t d u ...k 0 ) ) and the deter- 
minant a p b q —a q b p may be denoted by (a p b q ), so that a typical 
set of terms in (18) is 

±{apb q )(c t d u ...k e ). 

Moreover, all the terms of (18) are accounted for if we take all 
possible pairs of numbers p y q from 1, 2,..., n. Hence 

K = 2 ±Kf> a ){c t d u ...k g ), 

where the sum is taken over all possible pairs p } q. ^ 

In this form the fixing of the sign is a simple matter, for the 
leading diagonal term a p b q of the determinant ( a p b Q ) is pre- 
fixed by plus, as is the leading diagonal term c t d u ...kQ of the 
determinant (c t d u ... k e ). Hence, on comparison with (18), 

K = I(-m<*pb q )(c l d u ...k g ), (20) 

where 

(i) the sum is taken over all possible pairs p, q, 

(ii) are 1, 2 ,..., n in that order but excluding p and q, 

(iii) N is the total number of inversions of suffixes in 
p, q, t, u,..., e. 

The sum (20) is called the expansion of by its first two columns. 

7.2. When once the argument of §7.1 has been grasped it is 
intuitive that the argument extends to expansions such as 

\ = I(-i) A Mc r )(d^...k e ), 

= X ( — l) N (®p bqCfdgXe^ ... kg), 

and so on. We leave the elaboration to the reader. 


( 21 ) 

( 22 ) 
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The expansion (20) is called an expansion of A„ by second 
minors, (21) an expansion by third minors, (22) an expansion 
by fourth minors, and so on. 

Various modifications of §7.1 are often used. In §7.1 we 
expanded A n by its first two columns: we may, in fact, expand 
it by any two (or more) columns or rows. For example, A (l is 

equal to \ c, d, a, b, . . k, \, ) 

\ 


Cl 

d i 

«1 

h ■ 

. k l 

c 2 

d 2 

Ct 2 

K . 

k 2 


d n 

«#. 

K ■ 

h 


as we see by interchanging a and c and also 6 and d.' The 
Laplace expansion of the latter determinant by its first two 
columns may be considered as the Laplace expansion of A n by 
its third and fourth columns. 


7.3. Determination of sign in a Laplace expansion 

The procedure of p. 29, § 1.4, enables us to calculate with ease 
the sign appropriate to a given term of a Laplace expansion. 
Consider the determinant A = |a ra |, that is, the determinant 
having a r8 as the element in its rth row and sth column, and 
its Laplace expansion by the ^th and s 2 tl\ columns (.s^ < s 2 ). 
This expansion is of the form 



11 X A(rj, r 2 ; $ 2 ), 

'r.fi. 


(23) 


where (i) A (r li r t \8 v a 2 ) denotes the determinant obtained by 
deleting the ^th and r 2 th rows as also the ^th and * 2 th 
columns of A, and (ii) the summation is taken over all possible 
paijs r l9 r 2 (r x < r 2 ). 

Now, by §1.4 (p. 29), the terms in the expansion of A that 
involve a fx 8i are given by 


where A^;^) is the determinant obtained by deleting the r x th 
row and ^th column of A. Moreover, since r x < r 2 and < && 
the element a r%8% appears in the (r 2 — l)th row and (* 2 — l)th 



THE MINORS OF A DETERMINANT 


39 


column of A(r x ; a x ); hence the terms in the expansion of A(r x ;a x ) 
that involve o ri(i are given by 

(_ l)r.+»,- 2 0rjgj A(r x , r 2 ; s x , a a ), 

for A(r x , r 2 ; s v a 2 ) is obtained from A(r x ;a x ) by deleting the row 
and column containing o riS| . _ 

Thus the expansion of A contains the term 

( _ i yt+r.+o. + a >a fiSi a ri8i A(r x , r 2 ; s x , s 2 ), 
and therefore the Laplace expansion of A contains the term 

( l)ri+ri -»*,+«, 

That is to say, the sign to be prefixed to a term in the Laplace 
expansion is ( — l) a , where a is the sum of the row and column 
numbers of the elements appearing in the first factor of the term , . 

The rule, proved above for expansions by second minors, 
easily extends to Laplace expansions by third, fourth,..., minors. 

7.4. As an exercise to ensure that the import of (20) has been grasped, 
the reader should check the following expansions of determinants. 







fti 

c i d L 










a 2 

ft. 

c a d a 











ft. 

c a d a 










<*4 

ft. 

c 4 d 4 






<*1 

«>i 

X c 3 

da | 

- a x 

ft! 

X eg d a 

4* ®i 

ft! 

X Cg 


+ 

a a 

ft. 

c 4 

d 4 

«a 


c 4 d 4 

a A 

fti 

C » 

d. 


+ 

0 , 

b 2 X 

Cl 

d x - 

®, 

6g X c x 

d| 4 * 

a 3 

6, x 

Cl 

d, 



ft* 

C 4 

d 4 

®i. 

64 c 3 

d a 

«4 

fti 

c, 

d, 


The signs arc, by (20), those of the leading diagonal term products 

Oi&gCgdg, % & 64 Cjtiij, ... • 

Alternatively, the^igns are, by (24), 

(—Iji+a+i+a, ( _ 1 )i+3+i+», (— l)i+«+ 1 +*, .... 

(ii) Cj dj — — ” (c^j ^j)(Cg dg Cg). 

0 0 c a dj 

0 0 'Cj dg 

0 0 d A e 4 

Oj bj C B dg 65 

The sign is most easily fixed by (24), which gives it to be ( — l) l+5+1+a . 


a v,si a ri8 t xA(fj, r 2 \8i,8 2 ). 
a r,*i a r n 8 t 
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(iii) 0= o x &i Cx J>x Cx = AxAJ+AjAJ— A,AJ, 

0| 6| C| Cj 

®| 6g C, 6j Cg 

fl| 6| C4 6| Cg 

0 0 0 64 c 4 

whore Aj ss (0,6,0,), A, = (0364 c,), A, = (046x0,), and Af is the co- 
factor of a r in A,. 
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THE PRODUCT OF TWO DETERMINANTS 

1 . The summation convention 

It is an established convention that a sum such as 

2 «A (1) 

r=l 

may be denoted by the single term 

a T b T . (2) 

The convention is that when a literal suffix is repeated in the 
single term (as here r is repeated) then the single term shall 
represent the sum of all the terms that correspond to different 
values of r. The convention is applicable only when, by the 
context, one knows the range of values of the suffix. In what 
follows we suppose that each suffix has the range of values 
1, 2 

Further examples of the convention are 


X s> 

which denotes 

2 ^rs X 8f 

8 = 1 

TO 

(3) 

®rj bjsj 

which denotes 

2 a rj b ]8> 

3 -=1 

n n 

(4) 

<*ra x r X 3> 

which denotes 

2 2 a T8 X T*8- 

(5) 


r=l «=i 


In the last example both r and s are repeated and so we must 
sum with regard to both of them. 

The repeated suffix is often called a ‘dummy suffix’, a curious, 
but almost universal, term for a suffix whose presence implies 
the summation. The nomenclature is appropriate because the 
meaning of the symbol as a whole does not depend on what 
letter is used for the ‘dummy’; for example, both 

0y a x 9 and Xj 
stand for the same thing, namely, 

OVl *1+ <*ri x t + < • ■ ' + a m x n- 
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Any suffix that is not repeated is called a ‘free suffix’. On 
most occasions when we are using the summation convention, 

a„x. (6) 

not as a single expression in which r has a fixed value, but as a 
typical one of the n expressions 

n 

1a rs x s (r = 1,2, 

8 = 1 

Thus, in (6), r is to be thought of as a suffix free to take ankr one 
of the values 1, 2,..., n\ and no further summation of all\such 
expressions is implied . \ 

In the next section we use the convention in considering 
linear transformations of a set of n linear equations. 

2. Consider n variables x t (i = 1, 2,...,n) and the n linear formsf 

a nXi ( r = l,2,...,rc). 

When we substitute 

x t = K 

the forms (7) become 

a n b u x s (r = l,2,...,n). 

Now consider the n equations 


J i8 X S 


(i = 1,2 ,...,n), 


(V 

( 8 ) 

( 9 ) 


where 


C rs X * ~ 0 , 


C rs — a ri b ta- 


rn 

(H) 


If the determinant}: |c„| = 0, then (10) is satisfied by a set 
of values (t = 1,2 ,...,») which are not all zero (Theorem 11). 
But, when the X { satisfy equations (10), we have the n 

a ri x i = a ri b ia X a ~ C ra X a = 0 , 


( 12 ) 


equations 
and so 

either all the x i are zero, 

OR the determinant \a„\ is zero (Theorem 11). 

t The reader who is unfamiliar with the summation convention is recom- 
mended to write the next few lines in full and to pick out the coefficient of 

* / * v 

t Compare Chap. I, §4 (p. 21). 
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In the former case the n equations 

b u X. 0 

are satisfied by a set of values X { which are not all zero; that 
is, \b ra \ = 0. 

Hence if |c ra | = 0, at least one of the determinants |a ra |, 
|6 ra | is zero. 

The determinant \c ra \, i.e. \a ri b ia \, is of degree n in the a 9 a 
and of degree n in the 6’s, and so it is indicated that 

l C r*l = k\ a ra\ X \b r8 \> (13) 

where k is independent of the o’s and b’ s. 

By considering the particular case a ra = 0 when r ^ 8, 
a rr = 1, it is indicated that 

14.1 = l«r,|X|6j. (14) 

The foregoing is an indication rather than a proof of the 
important theorem contained in (14). In the next section we 
give two proofs of this theorem. Of the two proofs, the second 
is perhaps the easier; it is certainly the more artificial. 

3. Proofs of the rule for multiplying determinants 

Theorem 13. Let \a r8 \, |6 ra | be two determinants of order n; 
then their product is the determinant \c r8 \, where 

n 

C ra “ 2 a ri b is' 
i = 1 

3.1. First proof — a proof that uses the double suffix notation. 
In this proof, we shall use only the Greek letter aasa dummy 
suffix implying summation: a repeated Roman letter will not 
imply summation. Thus 
a i* b *i stands for a n b n +a u b n +.,. . 

a irKi stands for the single term; e.g. if r = 2, it stands for 
^21* 

Consider the determinant \c pq \ written in the form 
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and pick out the coefficient of 

^lr a 28 a 3 t- a nz ( 16 ) 

in the expansion of this determinant. This we can do by con- 
sidering all a lx other than a lr to be zero, all other than 
to be zero, and so on. Doing this, we see that the terms of \c pq \ 
that contain (15) as a factor are given byf 


“irKl 

a lr Kt 

■ ■ 

a ir b rn 



a 2s^8l 

a 2 8 ^82 

• 

a b 8n 



O'nz b zl 

nz ^z2 

. 

a nz b Z n 





= «lr 

a 2s • ■ * a nz Kl k r 2 

■ Kn 


^sl * ■ bgn 

Ki b z2 . - b zn 


Unless the numbers r, 5 ,..., z are all different, the ( b 9 deter- 
minant is zero, having two rows identical. When r, 5 ,..., z are 
the numbers 1 , 2 ,..., n in some order, the ( b’ determinant is 
equal to ( — 1)^16^!, wherein is the total number of inversions of 
the suffixes in r, 5 ,..., z (p. 11, §2.2, applied to rows). Hence 


l«wl = 2 (-l)%r«2*-O n J6 p9 |, (1C) 

where the summation is taken over the n! ways of assigning 
to r, s,..., z the values 1 , 2 ,..., n in some order and N is the 
total number of inversions of the suffixes r, z. But the 
factor multiplying \b pq \ in (16) is merely the expanded form of 


I I, so that 


3.2. Second proof — a proof that uses a single suffix notation. 
Consider, in the first place, 


Then 


A = 

a i ^1 

, A' = 

“i Pi 


a 

to 

to 


“2 Pi S 


AA' = 


°x 

-1 

0 


0 0 

6 2 0 0 

0 at, pi 

-l a 2 Pz 


t Note that a lr b n stands for a single term. 


(17) 
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as we see by using Laplace’s method and expanding the deter- 
minant by its last two columns. 

In the determinant (17) take 


r{ = r 1 +a 1 r a +& 1 r 4 , 

** = r 2 +<h r a+ b 2 U, 

using the notation of p. 17, § 3.5. Then (17) becomes 


AA' = | 0 0 a l a 1 +b 1 a t Oift+ftft 

0 0 a 2 a 1 -f6 2 a 2 a 2 0i+ftft 

-1 0 ft 

0 —1 a 2 ft 

and so, on expanding the last determinant by its first two 
columns, AA' = ^ft+ftft 

a 2 0 L 2 ~\~b a oi a o 2 ft -(-ftft 
The argument extends readily to determinants of order ». 
Let A = (ajft ... k n ), A' = (o£ 1 ft ... K n ). The determinant 


(18) 


K 

0 


0 


in which the bottom left 


— 1 “» 

hand quarter has — 1 in each element 


of its principal diagonal and 0 elsewhere, is unaltered if we 
write 


»i = ^H-aj r B+1 +ftr n+a +...-f-ftr a „, 
r 2 = r 2 +a 2 r„ +1 +6 2 r n+2 +...+A; 2 r 2n , 


r n r n +i~l _ ^n ^in‘ 

Hence 


AA' == 

0 . . 

0 . 



• « i 

0 . . 

0 ®n*l+"’+^n a » • 

■ °i» K i+—+^n K » ■ 


-1 . . 

0 oq 

*1 


0 . . 

— 1 a* 

*n 
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When we expand the last determinant by its first n columns, 
the only non-zero term is (for sign see p. 39, §7.3) 

( — 1) TO — 1 0 0 a 1 a 1 +...+k 1 ot n ■ a 1 K 1 +...+k 1 K n 

0—1 0 o 2 a x + ... + & 2 a n . a 2 K 1 + ... + k % K n 

-1 + - a n K l + — + fy* n 

\a n a 1 + ... + k n * n . . a n K i + ... + kn K \ 

Moreover, 2 n is an even number and so the sign to be prefixed 
to the determinant is always the plus sign. Hence 


1 ifcj 

■ 

^1 

. K X 

= 

«1«1+ — + A“» • a l K l + -~ + ^l K n 

a n ■ K 


■ 

■ K n 


. a n K 1 +...+k n K n 


(19) 

which is another way of stating Theorem 13. 


4. Other ways of multiplying determinants 

The rule contained in Theorem 13 is sometimes called the 
matrix f rule or the rule for multiplication of rows by columns. In 
forming the first row of the determinant on the right of (19) we 
multiply the elements of the first row of A by the elements 
of the successive columns of A'; in forming the second row 
of the determinant we multiply the elements of the second 
row of A by the elements of the successive columns of A'. The 
process is most easily fixed in the mind by considering (18). 

A determinant is unaltered in value if rows and columns are 
interchanged, and so we can at once deduce from (18) that, if 

b 1 A' jSj, 

b ° _ A 

then (18) may also be written in the forms 

AA' «i+Mi a i«2+ 6 iA 

a i “i+AA a* « 2+b a P a 

°1 a l +0* a 2 A ®l + £*2 a 2 

«iA+«iA AA+AA 

t See Chapter VI. 


0 0 
= (- 1 ) 2 * 


AA' = 


( 21 ) 
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[n (20) the elements of the rows of A are multiplied by the 
elements of the rows of A'; in (21) the elements of the columns 
of A are multiplied by the elements of the columns of A'. The 
first of these is referred to as multiplication by rows> the second 
as multiplication by columns . 

The extension to determinants of order n is immediate: 
several examples of the process occur later in the book. In the 
examples that follow multiplication is by rows: this is a matter 
of taste, and many writers use multiplication by columns or by 
the matrix rule. There is, however, much economy of thought if 
one consistently uses the same method whenever possible. 

Examples IV 

A number of interesting results can be obtained by applying the rules 
for multiplying determinants to particular examples. We shall arrange 
the examples in groups and we shall indicate the method of solution 
for at least ono example in each group. 

1. 0 (at—f}) a (a-y)* = 2{(P-y)(y-ai)(a— /?)}*. 

(j8-a)» 0 (/9-y)* 

(y — <*)* (y-/3) s o 

The determinant is the product by rows of the two determinants 
a 1 — 2a I , 1 a a* 

p 2 —2/3 1 1/8/3* 

y a — 2y 1 I y y* 

By Theorem 8, the first of those is equal to 2(/3 — y)(y— a)(a— /J) and the 
Bocond to (P—y)(y—oc)(oL—f3). 

[In applying Theorem 8 we write down the product of the differences 
and adjust the numerical constant by considering the diagonal term of 
the determinant.] 

2. The determinant of Example 3, p. 19, is the product by rows of the 
two determinants 

a* — 2a 1 0 , 1 a a* 0 

/ 3 * - 2 £ 10 1 p p* 0 

y 1 ~2y 10 i y y* 0 

8 4 —28 1 0 1 8 8* 0 

an d so is zero. 

3. Prove that the determinant of order n which has (a f — a,)* as the 
element in the rth row and sth column is zero when n > 3. 

4. Evaluate the determinant of order n which has (a*— a,) 3 as the 
element in the rth row and sfch column: (i) when n = 4, (ii) when 

n > 4, 
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5. Extend the results of Examples 3 and 4 to higher powers of a r — a a . 

6. {Harder.) Prove that 


(a-x) 3 

(a—y) 3 

(a-z) 3 

a B 

(b-xf 

(b—y) 3 

(b-z) 3 

b 3 

(c-x)» 

(c-y) 3 

(c-z) 3 

c 3 

X 3 

y 3 

z 3 

0 


= Sabcxyz(b — c)(c—a)(a — b)(y—z)(z—x)(^—y). 
7. {Harder.) Prove that 

(a—x) 2 — 2/) a (o-z) a 

(6— x) 2 (b—y) 2 ( b — z ) 2 

(c—x) 2 (c y) 2 ( c—z ) 2 c 

a* 2/ fc 

is zero when k = 1,2 and evaluate it when fc = 0 and when & > 3. 


8. Provo that, if s T = a r -f£ r +y r + S r , then 



8l 

*2 

83 

= 

1 

1 

1 

1 


82 

83 

*4 


OL 

p 

y 

8 

82 

83 

*4 

85 


a 2 

P 3 

y 3 

8* 

83 

8 \ 

8 6 

83 


a 3 

p 3 

y 3 

8» 


Hence (by Theorem 8) prove that the first determinant is equal to the 
product of the squares of the differences of a, /3, y, 8; i.e. (f(a,/?,y, 8)} a . 

9. Prove that (x—oc)(x—^)(x—y)(x—b) is a factor of 
A = 8 q 8 i 82 82 84 

S 1 ^2 ^3 

8 2 S3 flj 8 5 8$ 

^4 ^5 83 ^7 

1 X X 2 X 3 X 4 

and find the other factor. 

Solution . The arrangement of the elements s r in A indicates the pro- 
duct by rows (vide Example 8) of 

Aj = 1 1 1 1 0 , A 2 = I 1 1 1 t 

a P y 8 0 a j8 y 8 ? 

a 2 jS 3 / 8 2 0 a 2 0 2 y 2 8® ? 

a 3 )3 3 y 3 8 3 0 a 3 j8 3 y 3 8 3 ? 

? ? ? ? ? a 4 /3 4 y 4 8* ? 

If we have anything but 0’s in the first four places of the last row of 
A lf then we shall get unwanted terms in the last row of the product 
A x A a . So we try 0, 0, 0, 0, 1 as the last row of and then it is not 
hard to see that, in order to give A x A 8 = A, the last column of Ag must 
be I, x 9 x* 9 x 3 , ar 4 . 

The factors of A follow from Theorem 8. 
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10. Prove that, when 8 r = oJ+fF+y*, 


1 1 1 
“ P y 

a 3 j9 3 y 3 


* P V 

a 3 /J 3 y 2 


9 o. 8 i 
9 1 

*8 *4 


Note how the sequence of indices in the columns of the first deter- 
minant affects the sequence -of suffixes in the columns of the last 
determinant. 


11- Find the factors of 

*0 9 1 a 2 ^4 * 
8 2 8 t 

*6 $8 


where 8 r — a f -f /? r +y r . 


^3 


a* 


12. Extend the results of Examples 8-11 to determinants of higher 
orders. 


13. Multiply the determinants of the fourth order whose rows are 
given by 

*? + Z/r -2z r -2y r 1; \ x T y T 

and r = 1, 2, 3, 4. Hence prove that the determinant \a r9 \ 9 where 
a T8 — {x r -x 8 ) 2 -\-(y r —y s ) 2 , is zero whenever the four points (x r ,y r ) are 
concyclic. 

14. Find a relation between the mutual distances of five points on 
a sphere. 

15. By considering the product of two determinants of the form 

a-\-ib c-fid 
— (-c — id) a—ib 

prove that the product of a sum of four squares by a sum of four squares 
is itself a sum of four squares. 

16. Express a 3 -}-6 3 +c 3 — 3o5c as a circulant (p. 23, §5.2) of order 
three and hence prove that (a 3 +fe 3 +c 3 — 3a6c)(A 3 -|-B 3 -h0 3 — 3ABC) 
may be written in the form X*-\~Y*-\-Z*—3XYZ. 

Prove, further, that if A = a 2 — 5c, B = b 2 —ca, C = c 3 — ab, then 
A a + £3 + C 3 -3AJ5C = (a 3 + 6 3 +c 3 — 3a5c) 3 . 


17. If X 

ihSu 

and that 


~ ax + hy, Y = hxj -by; (x^yj, {x t fy 2 ) two sets of values 
x 1 X l +y l Y 1 , S i2 — a?f X M -\-y 1 Y t , etc.; prove that S l2 = S u 



s lt | 

= 

a 

h X 


2/i 

s tl 

S» 1 


h 

6 


y% 


H 


470* v 
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18 a. (Harder.) Using the summation convention, let & ss a T8 x r x* 9 
■where x l , a? 2 ,..., x n are n independent variables (and not powers of x) 9 
be a given quadratic form in n variables. Let X T A = a r 8 x J; 8 = ^r.A. 
where (ccj, arj), A = 1,..., n, denotes n sets of values of the variables 
& 1 , x 2 ,..., x n . Prove that 

“ | a rs| x |^| 3 - 

18b. (Easier.) Extend the result of Example 17 to three variables 
(x, y 9 z) and the coordinates of three points in space, (x v y l9 y 2 , z t ), 

(*^3» 2/3* * 3 )- 


5. Multiplication of arrays 
5.1, First take two arrays 

a ' b 1 Ci “i Pi y\ mi 

a 2 b 2 c 2 OL 2 p 2 y 2 

in which the number of columns exceeds the number of rows, 
and multiply them by rows in the manner of multiplying 
determinants. The result is the determinant 


a i 0 C i+b 1 p 1 +c 1 y 1 a i *2+ y 2 
a 2 oc 1 +b 2 P l + c 2 y 1 a 2 <x 2 +b 2 p 2 + c 2 y 2 


( 1 ) 


If we expand A and pick out the terms involving, say, 
p x y 2 we see that they are P^b^-b^). Similarly, the terms 
in p 2 y x are — (b 1 c 2 —b 2 c 1 ). Hence A contains a term 


(bfC 2 ) X (/?! y 2 ), 

where ( b 1 c 2 ) denotes the determinant formed by the last two 
columns of the first array in (A) and (P x y 2 ) the corresponding 
determinant formed from the second array. 

It follows that A, when expanded, contains terms 

(^1 c 2)(Pi y 2 ) + (ci a 2 )(yi <*2) + ( a i b 2 )(a 1 p 2 ) ; ( 2 ) 

moreover ( 2 ) accounts for all terms that can possibly arise from 
the expansion of A. Hence A is equal to the sum of terms in 
( 2 ); that is, the sum of all the products of corresponding detct- 
minants of order two that can be formed from the arrays in (A). 

Now consider a, 6 ,..., A,..., t, supposing k to be the wth letter 
and t the A 7 th, where n < N; consider also a corresponding 
notation in Greek letters. 
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The determinant, of order rc, that is obtained on multiplying 
by rows the two arrays 


. . Aq 

• 

. t x 

<*i . . K ± . . Tj 

K 


• 

a n 

A = a 1 oc ] 

L+ ‘ 

-+<i T i 

a l a n+--- + *l T n 

■I°n“ 

!+■ 

■•■+*» T 1 

0£ *t + •■■ + ^n T n 


This determinant, when expanded, contains a term (put all 
letters after k , the nth letter, equal to zero) 

a i a i+ — +*1*1 ■ * 

(4) 

a n 0L 1+ — + *n*l ■ • a ,i oc n+-‘ J t-kn K a 

which is the product of the two determinants (a x b 2 ... k n ) and 
(aq j3 2 ... K n ). Moreover (4) includes every term in the expansion 
of A containing the. letters a, 6 ,..., k and no other roman letter. 
Thus the full expansion of A consists of a sum of such products, 
the number of such products being N C n , the number of ways of 
choosing n distinct letters from N given letters. We have there- 
fore proved the following theorem. 

Theorem 14. The determinant A, of order n, which is obtained 
on multiplying by rows two arrays that have N columns and n 
rows , where n < N, is the sum of all the products of corresponding 
determinants of order n that can be formed from the two arrays. 

5.2. Theorem 15. The determinant of order n which is 
obtained on multiplying by rows two arrays that have N columns 
and n rows , where n > N, is equal to zero . 

The determinant so formed is, in fact, the product by rows’ 
of the two zero determinants of order n that one obtains on 
adding (n— N) columns of ciphers to each array. For example, 

®l a l + ^l^l a i a 2 + ^1^2 ^l a 3 + ^1^3 

«2a 1 +6 2 j8 1 a 2 oc 2 +b 2 fi 2 a 2 ot 2 +b 2 /3 2 

a 2 aq + 6 3 £1 ^OL 2 -\-b 2 p 2 tt 3 a 3 +6 3 ^3 
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is obtained on multiplying by rows the two arrays 

a i b i c Hi 

a 2 a 2 @2 

a 3 u 3 @3 

and is also obtained on multiplying by rows the two zero 
determinants 

0 y CKj 0 

a 2 b 2 0 ot 2 |S 2 0 

a 3 6 3 0 a 3 0 

Examples V 

1. The arrays 

1 0 0 0 0 0 0 1 

*i+2/i -2®! -2?/! 1 1 *1 2/l *1+2/1 

*4+2/J — 2 *4 — 22/4 1 1 *4 2/4 *2+3/4 

have 5 rows and 4 columns. By Theorem 15, the determinant obtained 
on multiplying them by rows vanishes identically. Show that this result 
gives a relation connecting the mutual distances of any four points in 
a plane. 

2. Obtain the corresponding relation for five points in space. 

3. If at, y,... are the roots of an equation of degree n, and if s T 
denotes the sum of the rth powers of the roots, prove that 

*0 *1 = 2 («— pv > 

«i *2 

«0 «1 *2 =2 (P— y?(y- a) s (a— /?)*• 

^8 

*2 *3 *4 

Hint. Compare Example 8, p. 48, and Theorem 14. 

4. Obtain identities by evaluating (in two ways) the determinant 
obtained on squaring (by rows) the arrays 

(i ) a b c (ii) abed 
a' b' c ' o' b ' c' d' 

5. In the notation usually adopted for conics, let 

S =5 oa? 2 + ... + 2/yz+..., 

+ X sh ax+hy+ gz, Y = hx+by+fz f Z ss gx+fy+cz, 

S rt = *f X s -\-y r Y t -\-z r Z t \ 
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let A f B H be the co-factors of a, 6,..., h in the discriminant of S. 

Let £ = 2/iZj— 3/i z i. V = H x t —z t x x , £ = y t —x t y x . Then 

S». S lt = Ap+Br ) * + C¥+2Fr)Z+2G££+2Ht7 l . 

S n ' 8„ 

Solution. The determinant is 

x \ X l +...+zfZ l x l X 2 -\-...+z l Z 2 
x 2 -XgH - ■■■ Z 2 

which is the product of arrays 

x i V\ z i Y x Z x 

x t y 2 z 2 X 2 Y 2 Z 2 

and so is 2 (Vi z i)( Y i z i)* 

But (F x Z 2 ), when written in full, is 

hx x + b y i +f z i 0*i +/2/i 4- cz A 
hx 2 + by 2 +fz 2 gx a +fy 2 + cz 2 

which is the product of arrays 

x i Vl z Y h b f 

X * Vi Z 2 9 f C 

and so is A£ + Hi) -f- G£ . 

Hence the initial determinant Is the sum of three terms of the type 
((At+H v + Qt). 

6. The multiplication of determinants of different orders 

It is sometimes useful to be able to write down the product 
of a determinant of order n by a determinant of order m , The 
methodf of doing so is sufficiently exemplified by considering 
the product of (a x b 2 c 3 ) by (a x j8 2 ), 

a i c x X ot x Pi 

a 2 b 2 c 2 ol 2 P 2 

^3 - C3 

= a^+bi^pi^ a x oc 2 +b x p z c x 

C 2 

°3 a lH"^3^1 a J a 2“l"^3^2 C 3 

We see that the result is true by considering the first and last 
determinants when expanded by their last columns. The first 
member of the above equation is 

{2 db c r( a * fy)} X (oc x p 2 )t 
t I learnt this method from Professor A. L. Dixon, 
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while the second is, by the rule for multiplying two determinants 
of the same order, 

2 i c r( a sfy) X (<*i £ 2 )* 

the signs having the same arrangement in both summations. 

A further example is 



= fl l a 2 “f“^ 1^2 c i ^ > 

® 2 a 2 ” l “^ 2^2 ^2 ^2 

® 3 a 2 ~( - ^ 3^2 C 3 ^3 
a 4 a 2”l“^4^2 ^4 ^4 

a result which becomes evident on considering the Laplace 
expansion of the last determinant by its third and fourth 
columns. i 

1 

The reader may prefer the following method : 


« 1 

61 

Cl 

X 

^1 “ ^ 

ft 

c x X 

“1 

ft 

0 

o 2 

b 2 

C 2 

“2 

P2 a 2 

^2 

c 2 

“2 

ft 

0 

0 3 

^3 

C 3 


<*3 

63 

C 3 

0 

0 

1 


= <*1 4 " a i a 2 ~t”^ 1^2 c i 

a 2 OL l~^‘b 2 Pi a 2 a 2“l”^2^2 C 2 
a 3 a l 4 ~^ 3 & ^3 $2 C 3 

This is, perhaps, easier if a little less elegant. 



CHAPTER IV 

JACOBI’S THEOREM AND ITS EXTENSIONS 
1 . Jacobi’s theorem 

Theorem 16 . Let A r , denote the co-factors of a r) b T> ... in 

a determinant A , , , , 

A = (<q b 2 ... kj. 

Then A' = (A 1 B 2 ... K n ) = A n_1 . 

If we multiply by rows the two determinants 


01 

*1 • 

. ky 

, A' = 

A 

B x . 

■ A 

®2 

b 2 . 

■ h 


A 2 

b 2 . 

. K 2 

a n 

k : 

. lc n 


A 

K . 

• A 


V® obtain ' AA' = A 0 . . 0 

0 A . . 0 

0 0 . . A 

for a r A s -\-b r B s -\-...-\-k r K s is equal to zero when r ^ s and is 
equal to A when r = s. 

Hence A A' = A’\ 

so that, when A ^ 0, A' = A’* -1 . 

But when A is zero. A' is also zero. For, by Theorem 12, if 
A = 0, then A r B a —A s B r = 0, so that the Laplace expansion 
of A' by its first two columns is a sum of zeros. 

Hence when A = 0, A' = 0 = A" -1 . 

Definition. A' is called the adjugate determinant of A. 

Theorem 17. With the notation of Theorem 16, the co-factor of 
A x in A' is equal to a x A" - *; and so for other letters and suffixes. m ■ 

When we multiply by rows the two determinants 

1 0 ... 0 
A 2 B 2 • • K 2 

« « • • • 

A n S n t * K n 




®2 ^2 
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we obtain 


®1 ®2 • • ®n » 

0 A . . 0 


0 0 . . A 


wherein all terms to the left of and below the leading diagonal 
are zero. Thus { 


b 2 . 

. K, 

B n • 

■ K n 


a x A*” 1 . 


\ 


When A ^ 0, this gives the result stated in the theorem.^ When 
A = 0, the result follows by the argument used when we con- 
sidered A' in Theorem 16; each B r C a —B 8 C r is zero. 

Moreover, if we wish to consider the minor of J r in A', where 
J is the 5th letter of the alphabet, we multiply by rows 


Ai • 

■ • 

. K l 

> 


• 3x ■ 


A- i ■ 

. J r _j . 

H 

1 

• S*t 


a r ■ 

• jr ■ 

• K 

o . 

. 1 . 

. 0 


. 

. 

. 

A r +i • 

• *4+i • 

• K*1 


«n • 

• 3n • 

• K 

An • 

• A ■ 

■ K n 





to obtain 


A . 

. o . 

. 0 

ii • 

* 3s 

* 3n 

o . 

o • 

. A 


(i) 


where then’s occur in the rth row and the only other non-zero 
terms are A’s in the leading diagonal. The value of (1) is 
j r A n ~K 


2. General form of Jacobi's theorem 
2.1. Complementary minors. In a determinant A, of 
order n, the elements of any given r rows and r columns, where 
r <. n, form a minor of the determinant. When these same rows 
And columns are excluded from A, the elements that are left form 
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a determinant of order n— r, which, when the appropriate sign 
is prefixed, is called the complementary minor of the original 
minor. The sign to be prefixed is determined by the rule that 
a Laplace expansion shall always be of the form 

Z +^r7n-r> 

where y n _ r is the minor complementary to p r . Thus, in 

i b-^ Cj d\ 

a 2 ^2 C 2 ^2 
a 3 b 3 c 3 d 3 

^4 ^4 ^4 

the complementary minor of 


a l C 1 

is + 


d 2 

<Z 3 C 3 



d 4 


since the Laplace expansion of (1) by its first and third columns 
involves the term -\-{a 1 c 3 )(b 2 d x ). 

Theorem 18 . With the notation of Theorem 16 , let 3R r be a 
minor of A' having r rows and columns and let y n _ r be the comple- 
mentary minor of the corresponding minor of~ A. Then 

m r = y n _ r A- 1 . (3) 

In particular , if A is the determinant (1) above , then, in the 
usual notation for co-factors, 

A x Ci b 2 d 2 A. 

A 3 C 3 64 ^4 

Let us first dispose of the easy particular cases of the theorem. 
If r = 1 , (3) is merely the definition of a first minor; for example, 
A x in (1) above is, by definition, the determinant {b 2 c 3 dj. 
If r > 1 and A = 0, then also 30l r = 0, as we see by Theorem 12. 
Accordingly, (3) is true whenever r = 1 and whenever A = 0. 

It remains to prove that (3) is true when r > 1 and A ^ 0. 
So lot r > 1, A 0; further, let ^ be the minor of A whose 
elements correspond in row and column to those elements of 
A', that compose 3Jl r . Then, by the definition of complementary 
minor, due regard being paid to the sign, one Laplace expansion 

4702 t 


( 1 ) 
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of A must contain the term -f-/i r y n _ r . Hence A may be written 
in the form 

Il t other 


other 

elements 


elements 


Vn- 


It is thus sufficient to prove the theorem when 501 is fornied from 
the first r rows and first r columns of A'; this we now do.\ 

Let A,..., E be the first r letters, and let F,..., K be t^e next 
n — r letters of the alphabet. When we form the product 


A r • 

■ 

, *1 

X 

«1 • 

• / x . 

* > 

A r . 

■ E r F r 

• K 



... 

m m 

0 . 

. 0 1 

. 0 





4 

0 . 

. 0 0 

• 1 


a n • 

■ fn ■ 

■ K 


where the last n—r elements of the leading diagonal of the first 
determinant are 1 ’s and all other elements of the last n—r rows 
are 0’s, we obtain 

A . . 0 0 . . 0 


0 

A 


. A 0 

Sr fr + 1 


0 

• /» 


I ^1 * ‘ K K+l • • 

That is to say, 

501, X A = A r y n _ r , 

and so (3) is true whenever r > 1 and A ^ 0. 



CHAPTER V 


SYMMETRICAL AND SKEW-SYMMETRICAL 
DETERMINANTS 

It. The determinant |o r< | is said to be symmetrical if a rt = a„ 
for every r and a. 

The determinant |a„| is said to be skew-symmetrical if 
Ojg = —a„ for every r and a. It is an immediate consequence 
of the definition that the elements in the leading diagonal of 
such a determinant must all be zero, for a„ = — a„. 

2. Theorem 19. A akew-symmetrical determinant of odd order 
kaa the value zero. 

The value of a determinant is unaltered when rows and 
columns are interchanged. In a skew-symmetrical deter- 
minant such an interchange is equivalent to multiplying each 
row of the original determinant by —1, that is, to multiply- 
ing the whole determinant by (—1)". Hence the value of 
a skew-symmetrical determinant of order n is unaltered when 
it is multiplied by (—1)"; and if n is odd, this value must 
be zero. 

3. Before considering determinants of even order we shall 
examine the first minors of a skew-symmetrical determinant of 
odd order. 

Let A r$ denote the co-factor of a rg , the element in the rth 
row and sth column of |a rg |, a skew-symmetrical deter- 
minant of odd order; then A rg is (— l) 71-1 ^, that is, 
A„ = A n . For A rs and A„ differ only in considering column 
for row and row for column in A, and, as we have seen 
in § 2, a column of A is (—1) times the corresponding row; 
moreover, in forming A rg we take the elements of n — 1 
rows of A. 

4. We require yet another preliminary result: this time one 
that is often useful in other connexions. 
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Theorem 20. The determinant 


a n 

a 12 

• n 

x t 

a 21 

a 22 

• a 2?i 

x 2 

a nl 

a ii2 

■ ^nn 

Xn 

V L 

• 

• Y n 

s 


= KsIS-f IA rs X r Y s , \ (2) 

r=l 8=1 \ 

where A rs is the co-factor of a rs in ] a rs | . \ 

The one term of the expansion that involves no -X" l^nd no 
Y is |a ra |£. The term involving X r Y s is obtained by putting 
S f all X other than X ri and all Y other than Y s to be zero. Doing 
this, we see that the coefficient of X r Y s is ±A rs . But, consider- 
ing the Laplace expansion of (1) by its rth and (w+l)th rows, 
we see that the coefficients of X r Y s and of a rs S are of opposite 
sign. Hence the coefficient of X r Y a is — A rs *. Moreover,; when 
A is expanded as in Theorem I (i) every term must involve 
either S or a product X r Y a . Hence (2) accounts for all the 
terms in the expansion of A. 

5. Theorem 21. A skew-symmetrical determinant of order 2 n is 
the square of a polynomial function of its elements. 

In Theorem 20, let \a r3 \ be a skew-symmetrical determinant 
of order 2 n— 1 ; as such, its value is zero (Theorem 19). Further, 
let Y r = — X n 8 = 0, so that the determinant (1) of Theorem 20 
is now a skew-symmetrical determinant of order 2n. Its value is 


2 lA rs X r X 8 . (3) 

Since |a„| = 0, A rs A gr = A,. r A 8S (Theorem 12), or A% = A„A es , 
and A r JA n == AJA ls , or A rg = A lr AJA n . Hence (3) is 

(X, V4 n ±X 2 V J 4 22 ±...±X n y'A nn ) 2 , (4) 

where the sign preceding X r is (—1)?, chosen so that 
A u = (-l)pJ(A n A„). 

Now A n ,... t A nn are themselves skew-symmetrical deter- 
minants of order 2n— 2, and if* we suppose that each is the 
square of a polynomial function of its elements, of degree n— 1, 
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then (4) will be the square of a polynomial function of degree n. 
Hence our theorem is true for a determinant of order 2 n if it is 
true for one of order 2n— 2. 

But 

0 u n = a\ v 

-an 0 

and is a perfect square. Hence our theorem is true when n = \. 

It follows by induction that the theorem is true for all 
integer values of n. 

6. The Pfaffian 

A polynomial whose square is equal to a skew-symmetrical 
determinant of even order is called a Pfaffian. Its properties 
and relations to determinants have been widely studied. 

Here we give merely sufficient references for the reader to 
study the subject if he so desires. The study of Pfaffians, 
interesting though it may be, is too special in its appeal to 
warrant its inclusion in this book. 

G. Salmon, Lessons Introductory to the Modern Higher Algebra (Dublin, 
1885), Lesson V. 

S. Barnard and J. M. Child, Higher Algebra (London, 1936), chapter ix, 
§ 22 . 

Sir T. Muir, Contributions to the History of Determinants , vols. i-iv 
(London, 1890-1930). This history covers every topic of deter- 
minant theory; it is delightfully written and can be recommended 
as a reference to anyone who is seriously interested. It is too 
detailed and exacting for the beginner. 


Examples VI (Miscellaneous) 


1. Prove that 


°n 

°12 

■ ■ <*in 

X, 

°ai 

°aa- 

• • a in 

x t 

°ni 

°»a ■ 

■ ■ Onn 

Xn 


x t . 

• • 

0 


= -{A lt Xl+...+[A r9 +A er )X r X 9 +...}, 


where A TB is the co-factor of a ri in |o„|. 

If a rB == a Br and \a fB | = 0, prove that the above quadratic form may 
be written aa -{A n X l+ ...+A u Z n m W . 
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that 


n » 

= 2 2>r. 

r- 1 i -1 

J = 



2X U 

= dS/dx u , a TB = 

a *r» 

°ll 

°12 

a ifc 


a ai 

°n 

. . . 


°fci 

°fca 


X k 



• • • ** 

s 


**• 




and k < n t prove 


k 

i\o. 

J . 


Hint. Use Example 1 , consider dJ/dx u , and use Theorem 

3 . Aj — (^{^3^4), A 2 — (a 3 64 Ci), Ag = (o 4 & 1 c 8 ) t A 4 = (0|6kc 3 ), and 
A- is the co-factor of a T in A 4 . Prove that A\ = —A}, that J( 1 J = A}, 
and that 

= A, A,. 


A\ 

A\ 

A\ 

= 0 , 

A i 

A\ 

A\ 

m 


B\ 


Bi 

B\ 

B\ 

ci 

ci 

Cl 


Cl 

Cl 

ci 


Hint. B\C\ — B\ C\ — — a 4 A 3 (by Theorem 17 ): use Theorem 10 . 


4 . A is a determinant of order n, a rB a typical element of A A r , the 
co-factor of a ra in A, and A ^ 0 . Prove that 

I A^ + A/a: A l2 ... A ln =0 

A 2 i A t 2+A/x . . . A in 


whenever 


A ni 

A fl 2 

■ • ^n» + A 

a n +x 

fl u 


= 0 

a ll 

Ogg + X . 

■ ■ "an 


a nl 

a n8 

■ • «nn + * 



5 . Prove that 


1 sin a cos a sin 2a cos 2a | = 256 n sin J(a— /?), 

1 sin p cob ft sin 2/8 cos 20 

1 siny cosy sin2y cos 2y 
1 sin 8 coaS sin 25 cos 28 

1 sine cos € sin 2c cos2e 

with a proper arrangement of the signs of the differences. 

6 . Prove that if A = |a r J and the summation convention be used 
(Greek letters only being used as dummies), then the results of Theorem 
10 t$ay be expressed as 

5=1 0 s* A #a when r ^ a f 
a a$A ar = A = a r# A Ja when r = *. 
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Prove also that if n variables x are related to n variables X by the 

transformation _ 

A r — a r a * a » 

then Aa? a = A aB X a . 

7. Prove Theorem 9 by forming the product of the given circulant 
by the determinant (aij, aft), where w n are distinct nth roots 
of unity. 

8. Prove that, if f T {a) = g r (a) = h r ( a) when r = 1, 2, 3, then the 
determinant 

/i(z) 0i(*) M*) 

/«(*) g t fc) 

/a(z) Q 3 ( x ) h 3 ( x ) 

whose elements aro polynomials in x , contains (x — a) z as a factor. 

Hint. Consider a determinant with c[ = c x — c 2> cj = c 2 — c 3 and use 
the remainder theorem. 

9. If the elements of a determinant are polynomials in x , and if r 
columns (rows) become equal when x = a, then the determinant has 
(x— o) r_1 as a factor. 

10. Verify Theorem 9 when n = 5, a Y = a 4 = a 5 = 1, a a = a, and 
a 3 = a 2 by independent proofs that both determinant and product are 

equal to (a'+o+3)(a-l)<(a‘ + 3a 3 +4a» + 2a+l). 




PART II 
MATRICES 




CHAPTER VI 

DEFINITIONS AND ELEMENTARY PROPERTIES 


1. Linear substitutions 

We can think of n numbers, real or complex, either as separate 
entities x v x 2 ,..., x n or as a single entity x that can be broken 
up into its n separate pieces (or components) if we wish to do 
so. For example, in ordinary three-dimensional space, given 
a set of axes through 0 and a point P determined by its 
coordinates with respect to those axes, we can think of the 
vector OP (or of the point P) and denote it by x } or we can 
give prominence to the components x v x 2i x 3 of the vector along 
the axes and write x as (x v x 2 , x 3 ). 

When we are thinking of £ as a single entity we shall refer to 
it as a 'number . This is merely - a slight extension of a common 
practice in dealing with a complex number z, which is, in fact, 
a pair of real numbers x, 2 /.f 

Now suppose that two 'numbers' x , with components x v x 2 ,..., 
x n , and X, with components X v X 2 ,..., X n , are connected by 
a set of n equations 

X r = a rl x 1 +a r2 x 2 +...+a rn x n (r = l,...,n), (1) 


wherein the a r3 are given constants. 

[For example, consider a change of axes in coordinate geo- 
metry where the x r and X r are the components of the same 
vector referred to different sets of axes-] 

Introduce the notation A for the set of n 2 numbers, real or 
complex, 


a ll 

a i2 

• a ln 

19 

1— 

^22 

• a 2n 

a nl 

a n2 • 

• ®nn 


and understand by the notation Ax a ' number 9 whose components 
are given by the expressions on the right-hand side of equations 
11). Then the equations (1) can be written symbolically as 

X = Ax. (2) 


t Cf. G. H. Hardy, Pure Mathematics , chap* iii. 
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The reader will find that, with but little practice, the sym- 
bolical equation (2) will be all that it is necessary to write when 
equations such as (1) are under consideration. 


2. Linear substitutions as a guide to matrix addition 

It is a familiar fact of vector geometry that the ium of a 
vector X , with components X l9 X 2y X 3 , and a vector Y, with 
components Y ly Y 2i Y 3i is a vector Z, or X-\-Y y with components 

x 2 +y 2 , x 3 +y 3 . 


Consider now a 'number* x and a related 4 number * 



X = Ax, 


( 2 ) 


where A has the same meaning as in § 1. Take, further, a second 
4 number Y given by Y = Bx (3) 


where B symbolizes the set of n 2 numbers 
&n b 12 . b ln 


b nl t) n 2 • ■ ^nit 

and (3) is the symbolic form of the n equations 

% = b r iXi+b r2 x 2 +...+b rn x n {r = 1, 2,..., w). 

We naturally, by comparison, with vectors in a plane or in 
space, denote by X-\-Y the 4 number 9 with components X r +Y r . 
But 

X r +Y r = (“,l+6rlK+(«rt+W*l + - + (a rn +b r n) x n> 
and so we can write 

X+Y = (A-\-B)x 

provided that we interpret A-\- B to be the set of »* numbers 

°ll+^n °12+^12 ■ • °ln+^l» 

®»l+&nl ®jt 2 +^n 2 • • nn 

We are thus led to the idea of defining the sum of two symbols 
such as A and B. This idea we consider with more precision 
in the next section. 
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3. Matrices 

3.1. The sets symbolized by A and B in the previous section 
have as many rows as columns. In the definition that follows 
we do not impose this restriction, but admit sets wherein the 
number of rows may differ from the number of columns. 

Definition 1 . An array of mn numbers , real or complex , the 
array having m rows and n columns , 


a n 

a l2 

■ 

• a ln 

a 21 

a 22 

■ 

• a 2 n 

- a ml 

a m2 

. 

■ - 


is called a matrix. When m = n the array is called a square 
matrix of order n. 

In writing, a matrix is frequently denoted by a single letter 
4, or a, or by any other symbol one cares to choose. For 
example, a common notation for the matrix of the definition 
is \a r8 \. The square bracket is merely a conventional symbol 
(to mark the fact that we are not considering a determinant) 
and is conveniently read as ‘the matrix’ . 

As we have seen, the idea of matrices comes from linear 
substitutions, such as those considered in § 2. But there is one 
important difference between the A of §2 and the A that 
denotes a matrix. In substitutions, A is thought of as operating 
on some ‘ number ’ x. The definition of a matrix deliberately 
omits this notion of A in relation to something else, and so 
leaves matrix notation open to a wider interpretation. 

3.2. We now introduce a number of definitions that will 
make precise the meaning to be attached to such symbols as 
-4 + iJ, A — B, 2 A, when A and B denote matrices. 

Definition 2. Two matrices A, B are conformable for 
addition when each has the same number of rows and each has 
the same number of columns . 

Definition 3. addition. The sum of a matrix A , with an 
dement a rg in its r-ih row and s-th column , and a matrix B, with 
an element b rs in its r-th row and s-th column , is defined only when 
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A , B are conformable for addition and is then defined as the matrix 
having a r9 -\-b ra as the element in its r-th row and s-th column . 

The sum is denoted by A + B. 


The matrices 


'a x 0" 


. a 2. 

.®2 °. 


are distinct; the first cannot be added to a square matijix of 
order 2, but the second can; e.g. 

K oi+r6 x c x n = K+&! e^. 

\_a 2 OJ |_&2 c 2 J U2+&2 c 2 J 

Definition 4. The matrix —A is that matrix whose elements 
are those of A multiplied by — 1. 

Definition 5. subtraction. A—B is defined as A-\-(— B). 
It is called the difference of the two matrices. 

For example, the matrix —A where A is the one-rowed 
matrix [a, 6J is, by definition, the matrix [—a, —6]. Again, by 
Definition , 


®1 ^1 " 

9d 

1 

b 2 

= K VH 

- —o 2 —b 2 l 

- C 1 d l. 

Lc 2 

d 2 

L c i d i. 

- C 2 ^2j 


and this, by Definition 3, is equal to 

K a 2 ^1 ^2~| ■ 

L C 1 C 2 ^1 

Definition 6. A matrix having every element zero is called 
a null matrix, and is zvritten 0. 

Definition 7. The matrices A and B are said to be equal , and 
we write A = B, when the two matrices are conformable for addi- 
tion and each element of A is equal to the corresponding element 

of B. 

It follows from Definitions 6 and 7 that ( A = B* and 
4 A — B = O’ mean the same thing, namely, each a rB is equal to 
the corresponding b r3 . [butx**- 

Definition 8. multiplicat ion by: a number . When r is 
a number , real or complex , and A is a matrix, rA is defined to 
be the matrix each element of which is r times the corresponding 
element of A. 



DEFINITIONS AND ELEMENTARY PROPERTIES 71 


For example, if 


K 6, > 

then 3*4 = ["30, 

36,1 

LCi d u 


3dJ 


In virtue of Definitions 3, 5, and 8, we are justified in writing 
2 A instead of A+A, 

3 A instead of 5A—2A, 

and so on. By an easy extension, which we shall leave to the 
reader, we can consider the sum of several matrices and write, 
for example, (±-\-i)A instead of A-{-A+A-\-(\+i)A. 

3.3. Further, since the addition and subtraction of matrices 
is based directly on the addition and subtraction of their 
elements, which are real or complex numbers, the laws that 
govern addition in ordinary algebra also govern the addition 
of matrices. The laws that govern addition and subtraction in 
algebra are : (i) the associative law, of which an example is 

either sum being denoted by a+6 + c; 

(ii) the commutative law, of which an example is 
d~\ -b = b-\~d\ 

(iii) the distributive law, of which examples are 
r(a+6) = m+rb, —(a— 6) = — a+6. 

The matrix equation (A + B) + C = A + (B+C) is an imme- 
diate consequence of (a-\-b)-\-c = a-f-(6+c), where the small 
letters denote the elements standing in, say, the rth row and 
sth column of the matrices A } B, C ; and either sum is denoted 
by A+B+C. Similarly, the matrix equation A+B = B-\-A 
is an immediate consequence of a+6 = 6+a. Thus the asso- 
ciative and commutative laws are satisfied for addition and 
subtraction, and we may correctly write 

A +B+(C+D ) = A+B+C+D 

= (B+C)+(A+D); 

and so on. 

On the other hand, although we may write 

r{A+B) = rA+rB 
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when r is a number, real or complex, on the ground that 

r(a+b) = ra+rb 

when r, a y b are numbers, real or complex, we have not yet 
assigned meanings to symbols such as R(A-\-B)> RA, RB when 
R, A, B are all matrices. This we shall do in the sections that 
follow. 

4. Linear substitutions as a guide to matrix multiplica- 
tion 

Consider the equations 

X 1 = *1101 + 012 02. 01 = ^11 Z 1 + ^12 Z 2» j \ ^2) 

X 2 = *2101 + *22 02 > 02 = ^ 21 Z 1 + ^ 22 Z 2 > i 

where the a and b are given constants. They enable us to 
express x v x 2 in terms of the a, b, and z v z 2 ; in fact, 

X 1 = ( a il^ll + *12^2l) Z l+(®11^12+*12^22) Z 2> \ 
x 2 = (*21^U + *22^2l) Z l+( a '21^12 + *22^22) Z 2* / 

If we introduce the notation of § 1 , we may write the equa 


( 2 ) 


tions (1) as 


(la) 


x = Ay, y = Bz . 

We can go direct from x to z and write 

x = ABz , (2 a) 

provided we interpret AB in such* a way that (2 a) is the sym- 
bolic form of equations (2); that is, we must interpret AB to 
be the matrix 


Kl&ll+« 1«&21 a il^l 2 + a i 2 ^ 22 l- ^ 3 ) 

[a 21 6 u+a 2 2 ^ 2 i *21^12+^22^22] 

This gives us a direct lead to the formal definition of the 
product AB of two matrices A and B. But before we give this 
formal definition it will be convenient to define the "scalar pro- 
duct* of two * numbers *. 


5. The scalar product or Inner product of two numbers 

Definition 9. Let x, y be two "numbers*, in the sense of § 1, 
having components x v x n and y l9 y n . Then 

*101+*202+-+*n0n 
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is called the inner product or the scalar product of the two 
numbers . If the two numbers have m, n components respectively 
and m ^ n, the inner product is not defined . 

In using this definition for descriptive purposes, we shall 
permit ourselves a certain degree of freedom in what we call 
a ‘ number \ Thus, in dealing with two matrices [a rs ] and [6 rs ], 
we shall speak of 

a il &12 + a i2 &22 + a l3 &32+ ■ ■ ■ + a lw &n2 (1) 

as the inner product of the first row of [a r J by the second 
column of [ b rs \ . That is to say, we think of the first row of 
[a rs ] as a ‘ number 9 having components (a n , a 12 ,. a ln ) and of 
the second column of [6 rs ] as a "number having components 
(&12> ^22>'--> ^ 712 )- 

6. Matrix multiplication 

6.1. Definition 10 . Two matrices A, B arc conformable 
for the product AB when the number of columns in A is 
equal to the number of rows in B . 

Definition 11. product. The product AB is defined only 
when the matrices A , B are conformable for this product: it is 
then defined as the matrix whose element in the i-th row and k-th 
column is the inner product of the i-th row of A by the k-th column 
of B. 

It is an immediate consequence^ of the definition that AB 
has as many rows as A and as many columns as B. 

The best way of seeing the necessity of having the matrices 
conformal is to try the process of multiplication on two non- 
conform able matrices. Thus, if * 

A = r® 1 ]’ B =\ b \ c ii« 

\_a 2 \ \b 2 c 2 J 

AB cannot be defined. For a row of A consists of one letter 
and a column of B consists of two letters, so that we cannot 

t The reader is recommended to work out a few products for himself. One 
or two examples follow in §§ 6.1, 6.3. , * 

AYliJ t 
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form the inner products required by the definition. On the 
other hand, BA = ^ c n . r a i 

\p2 C 2J U 2 J 

= rMi+ c i a 2 ]> 

[6 2 a 1 -j“C 2 a 2 J 

a matrix having 2 rows and 1 column. 

y/6.2. Theorem^. The matrix AB is , in general, district from 
the matrix BA. 


As we have just seen, we can form both AB and &A only 
if the number of columns of A is equal to the number of rows 
of B and the number of columns of B is equal to the number of 
rows of A. When these products are formed the elements in 
the ith row and ith column are, respectively. 


in AB, the inner product of the ith row of A by t^ie fcth 
column of B; 

in BA, the inner product of the itli row of B by the Jttli 
column of A . 


Consequently, the matrix AB is not, in general, the same 
matrix as BA. 


6.3. Pre -multiplication and post -multiplication. Since 
AB and BA are usually distinct, there can be no precision in 
the phrase ‘multiply i4 by 5’ until it is clear whether it shall 
mean AB or BA. Accordingly, we introduce the terms f post- 
multiplication and pre-multiplication (and thereafter avoid the 
use of such lengthy words as much as we can!). The matrix A 
post-multiplied by B is the matrix AB; the matrix A pie- 
multiplied by B is the matrix BA. 

Some simple examples of multiplication are 

\d 61 X [at yl = faa+ftjS ay+ 681 , 

[c d\ [/? 8j [ca -\~dfi cy+dSJ 

’a h g m I x pel = \ax+hy+gz\. 

h b f\ M \hz+by+fz 

jg f d UJ Lgz+fy+cz m 

t Sometimes one uses the terms fore and aft instead of pre and poet. 
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6.4. The distributive law for multiplication. One of the 

laws that govern the product of complex numbers is the dis- 
tributive law, of which an illustration is a(6+c) = ab+ac. This 
law also governs the algebra of matrices. 

For convenience of setting out the work we shall consider 
only square matrices of a given order n , and we shall use A , B,... 
to denote [a ik ], [6 lfc ],... . 

We recall that the notation [o ifc ] sets down the element that 
is in the ith row and the &th column of the matrix so denoted. 
Accordingly, by Definition 11, 

In this notation we may write 

MB+O = Mx([6 ifc ]+[c ift ]) 

= [«<*] x [6 lfc + C lt ] (Definition 3) 

= [ A l a «A(*A*+CAfc)] (Definition 11) 

= f i a i\ *>jJ + [ I c Afc] (Definition 3) 

= AB+AC (Definition 11). 

Similarly, we may prove that 

(A+B)C = AC+BC. 

6.5. The associative law for multiplication. Another law 
hat governs the product of complex numbers is the associative 
aw, of which an illustration is (a6)c = a(6c), either being com- 
nonly denoted by the symbol abc . 

We shall now consider the corresponding relations between 
hree square matrices A, B, C\ each of order n. We shall show 

mt > ( AB)C = A{BC) 

md thereafter we shall use the symbol ABC to denote either 
>f them. 

Let B = [6 <fc ], C = [c jfc ] and let BC = [y <fc ], where, by 
definition 11, n 

Yih 
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Then A(BC) is a matrix whose element in the ith row and 

kih column is n n n 

I a ilYlk = I a n% b n c ik 
1 = 1 1 = 1 0=1 

= 2 ^L a il b lj c jk- i (1) 

t — 1 2 = 1 

Similarly, let AB = [p ik \, where, by Definition 11,^ 

n 

Pik = 2 

Then (AB)C is a matrix whose element in the ith Aow and 

fcth column is n n n 

2Pii c ik = 2 2 ( 2 ) 
1=1 1 = 1 2=1 

But the expression (2) gives the same terms as (1), though 
in a different order of arrangement; for example, the terra 
corresponding to l = 2, j — 3 in (1) is the same as the terra 
corresponding to l = 3, j = 2 in (2). 

Hence A(BC) = (A B)C and we may use the symbol ABC 
to denote either. / 

We use A 2 , A 3 ,... to denote AA, AAA , etc. 

6.6. The summation convention. When once the prin- 
ciple involved in the previous work has been grasped, it is best 
to use the summation convention (Chapter III), whereby 

b io c ok denotes 

By extension, 

nn 

a u b ij c jk denotes £ 2 a u b u c ik> 

i~i i=i 

and once the forms (1) and (2) of §6.5 have been studied, it 
becomes clear that any repeated suffix in such an expression is 
a "dummy* and may be replaced by any other; for example, 

a il b lj c jk = a ii b il c lk • 

Moreover, the use of the convention makes obvious the la^ 
of formation of the elements of a product ABC...Z ; this pro- 
duct of matrices is, in fact, 

[ a il b lj c jm 

the ij k being the only suffixes that are not dummies. 
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7. The commutative law for multiplication 

7.1. The third law that governs the multiplication of com- 
plex numbers is the commutative law, of which an example is 
ab = ba. As we have seen .(Theorem 22), this law does not hold 
for matrices. There is, however, an important exception. 

Definition 12. The square matrix of order n that has unity 
in its leading diagonal places and zero elsewhere is called the 
unit matrix of order n. It is denoted by 7. 

The number n of rows and columns in I is usually clear from 
the context and it is rarely necessary to use distinct symbols 
for unit matrices of different orders. 

Let C be a square matrix of order n and I the unit matrix of 
order n; then IC — Cl — C 

Also 72 = 73 = .... 

Hence 7 has the properties of unity in ordinary algebra. Just 
as we replace 1 X x and xxl by a; in ordinary algebra, so we 
replace 7 X C and C X 7 by C in the algebra of matrices. 

Further, if k is any number, real or complex, kl.C = C.kl 
and each is equal to the matrix kC. 

7.2. It may also be noted that if A is a matrix oi*n rows, 
B a matrix of n columns, and 7 the unit matrix of order n, 
then IA = A and BI = B , even though A and B are not 
square matrices. If A is not square, A and 7 are not con- 
formable for the product AI. 


8. The division law 

Ordinary algebra is governed also by the division la w, which . 
states that when the product xy is zero, either x or y ^or both J 
must be zero. This law does not govern matrix products. Use 
0 to denote the null matrix. Then AO = OA = 0 9 but the 
equation AB = O does not necessarily imply that A or B is 
the null matrix. For instance, if 

4=[fl R B=[ b 26 1, 

|0 Oj L-fl — 2a J 
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the product AB is the zero matrix, although neither A nor B 
is the zero matrix. 

Again, AB may be zero and BA not zero. For example, if 

i 

AB = [0 0], BA = r ab 6 a ]. 

[o oj [— a 2 — ab\ 

9. Summary of previous sections 

We have shown that the symbols A, B,..., representing 
matrices, may be added, multiplied, and, in a large measure, 
manipulated as though they represented ordinary numbers. 
The points of difference between ordinary numbers a, b and 
matrices A, B are 

(i) whereas ab = ba, A B is not usually the same as BA ; 

(ii) whereas ab = 0 implies that either a or 6 (or both) is 
zero, the equation AB = 0 does not necessarily imply 
that either A or B is zero. We shall return to this point 
in Theorem 29. 

'lO. The determinant of a square matrix 

10.1. When A is a square matrix, the determinant that has 
the same elements as the matrix, and in the same places, is 
called the determinant of the matrix. It is usually denoted by 
|A|. Thus, if A has the element a ik in the ith row and 4th 
column, so that A = [a lfc ], then \A\ denotes the determinant 

\ a ik( • 

It is an immediate consequence of Theorem 13, that if A and 
B are square matrices of order n, and if AB is the product of 
the two matrices, the determinant of the matrix AB is equal to the 
product of the determinants of the matrices A and, B; that is, 

\AB\=\A\x\B\. 

Equally, \BA\ = |B| x \A\. 

Since \A\ and |B| are numbers, the commutative law holds for 
their product and \A\ x |5| = |B| x | A |. 
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Thus \AB\ = \BA |, although the matrix AB is not the same 
as the matrix BA. This is due to the fact that the value of 
a determinant is unaltered when rows and columns are inter- 
changed, whereas such an interchange does not leave a general 
matrix unaltered. 


10 . 2 . The beginner should note that in the equation 
\AB\ = |-4 1 X |.B|, of §10.1, both A and B are matrices. It is 
not true that \kA \ = lc\A\, where Ic is a number. 

For simplicity of statement, let us suppose A to be a square 
matrix of order three and let us suppose that k = 2. The 
theorem ‘ \A+B\ = + ’ 

is manifestly false (Theorem 6, p. 15) and so \2A | ^ 2\A\. The 
true theorem is easily found. We have 



A 

= 

a n 

a i2 

“is" 

> 




a 2i 

®22 

®23 





- a ai 

«32 

®33- 


say, so that 







2A 

pH 

pH 

e 

(N 

II 

2a 12 

2°is 

1 (Definition • 


1 % a 2l 

2 a 22 

2®23 




L2a ai 

2a 32 

2®aa- 



Hence \2A \ = 

2*\A\. 







The same is evident on applying § 10.1 to the matrix product 
2 lx A, where 27 is twice the unit matrix of order three and so is 

2 0 (T . 

0 2 0 

0 0 2 . 


Examples VII 

Examples 1-4 are intended as a kind of mental arithmetic. 
1. Find the matrix A + B when 


(i) 

A = | 

B 

3- 


B = | 

[? 3 ‘ 

<ii) 

A = 

1 

— r 

» 

B = 

'3 - 31 ; 



2 

-2 



4 -4 



.3 

- 3 . 



,6 -5 

(iii) 

A = [1 

2 3 ], 

B = [4 6 0], 
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Ans. (i) f 6 8], (ii) r 4 -41, (iii) [5 7 9]. 

LlO 12J -6 

-8 

2. Is it possiblo to define the matrix A + B when 

(i) A has 3 rows, B has 4 rows, 

(ii) A has 3 columns, B has 4 columns, 

(iii) A has 3 rows, B has 3 columns ? 

Ans. (i), (ii), No; (iii) only if A has 3 columns and B 3 rows. 

3. Is it possible to define the matrix BA and the matrix IB when 

A f B have the properties of Example 2 ? \ 

Ans. AB (i) if A has 4 cols.; (ii) if B has 3 rows; (iii) depends on the 
number of columns of A and the number of rotes of B. 

BA (i) if B has 3 cols.; (ii) if A has 4 rows; (iii) always. 

4. Form the products AB and BA when 

'*“[! !]■ B -[l 3' '“G a- J 

Examples 5-8 deal with quaternions in their matrix form. j 

5. If l denotes y/(—l) and I, t, j, k are defined as 


II 

o 

<s>. 

II 

** 

o 

. i = r o n, k 

= fO 

L V 

Lo lj Lo -J 

l-l oj 

It 

oj 

then ij = k , jk = t, ki = j; 

<?>. 

II 

1 

II 

1 

ik = 

-i- 

Further, i 2 = j 2 , 

= fc 2 = -I. 



6- If Q = al+bi+cj+dkt Q' 

numbers, real or complex, then 

— al — bi — cj—dk, a. 

b, c. 

d being 


QQ' = (a 2 + 6 2 + c 2 + d 2 )J. 

7. If P = atI-\-piJ r yj-i r §j Cj p' — ocI—pi^yj—Bk, a, /J, y, 8 being 
numbers, real or complex, then 

qpp'q' = 

= QQMa 2 +z8 2 +y* + 8 2 )I [see §7] 

= (a 2 -f 6 2 + c 2 + d 2 )/.^+)3 2 -fyH8 a )I 
= (a 2 + 6 2 + c 2 +d 2 )(a 2 + j9*+y 2 +8 2 )/ 

= Q'P'PQ. 


8, Prove the results of Example 5 when I, i, j t k are respectively 


rl 0 0 0i 
10 1 0 0 
0 0 10 

f 

■ 0 10 On 

-10 0 0 

0 0 0 —1 

> 

■ 0 0 10-1 

0 0 0 1 

-10 0 0 

» 

0 0 0 1- 

0 0-10 
0 10 0 

U 0 0 1. 


- 0 0 1 0 . 


- 0 -10 0. 


.-1 0 0 OJ 


9. By considering the matrices 


i=ri ol 


i = 


0 1 
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together with matrices al+bi, al—bi, where a, b are real numbers, show 
that the usual complex number may be regarded as a sum of matrices 
whose elements are real numbers. In particular, show that 

(al+bt)(al— bi) = (o* + 6 a ) /, 

(aI+bi)(cI-\-di) = (oc— 6d)/+(od+6c)i. 

10. Prove that 


\x y z].Ta h g 
h b j 
.9 f c. 



[ax 2 + by 2 + cz 2 + 2 \fyz + 2 gzx + 2 hxy]. 


that is, the usual quadratic form as a matrix having only one row and 
one column. 


11. Prove that the matrix equation 

A*-B 2 = (A-B)(A + B) 

is true only if AB — BA and that aA 2 + 2hAB + bB 2 cannot, in general, 
be written as the product of two linear factors unleash B = BA. 


12. If Aj, A a are numbers, real or complex, and A is a square matrix 
of order n, then 

^■-A+A.M+AA J = (A-Ax/J^-A./), 

where I is the unit matrix of order n. 

Hint. The R.H.S. is A 2 —X t AI—X l JA-f AA/ 2 (by the distributive 
law). 

13. If /(A) = p 0 A n +PiA 1i - 1 -l-...-l-p ft , where p rt A are numbers, and if 
/(A) denotes the matrix 

p 0 A n +px A "- 1 + . . . +p n , 

then f(A) = p 0 (A -A x I)..\A -A„ /), 

where Ai»— * A n are the roots of the equation /(A) = 0. 

14. If B = XA+fil, where A and /x are numbers, then BA = AB. 


The use of submatrlces 

15. The matrix P 


ma y be denoted by 



P u denotes [p xl p lt 1, P& denotes fPn]t 

LPai PiJ 

P*i denotes [p w p M ], P n denotes [pj. 


4701 
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Prove that, if Q, Q n , etc., refer to like matrices with q ik instead of p ik , 

then P+Q= [Ai + On J’i.+ OwI. 

PQ fPn O11 + P12 Qai P11 On+*PiaOaal' 

I-P 21 Qn+Aa Qn ^21 Gia+'^aaOaaJ 

Note. The first ‘column’ of PQ as it is written above merely 
a shorthand for the first two columns of the product P when it 
is obtained directly from [p ifc ] X [g**]. 

16. Prove Example 8 by putting for 1 and 1 in Example 6 the two- 

rowed matrices of Example 9. \ 

17. If each P ra and Q ra is a matrix of two rows and two columns, 
prove that, when r = 1, 2, 3, a = 1, 2, 3, 

[Pr.]X[<2r,] = [J^r* <?*.]• 

Elementary transformations of a matrix 

18. Iij is the matrix obtained by interchanging the ith and^th rows 
of the unit matrix I. Prove that the effect of pre -multiplying a square 
matrix A by Jy will be the interchange of two rows of A , and of post- 
multiplying by Iij, the interchange of two columns of A. 

Deduce that ly = I , I ik I kj Iji = I kj . 

19. If H = /+[fcy]» the unit matrix supplemented by an element h 
in the position indicated, then HA affects A by replacing row* by 
row*+hrow^ and AH affects A by replacing col.^ by col.^+Acol.*. 

20. If H is a matrix of order n obtained from the unit matrix by 
replacing the rth unity in the principal diagonal by k , then HA is the 
result of multiplying the rth row of A by k, and AH is the result of 
multiplying the rth column of A by k. 

Examples on matrix multiplication 

21. Prove that the product of the two matrices 


r cos f 0 

cos0sin0], 

T cos *<f> cos ^ sin 

Lcos0sin0 

sin 2 0 J 

[cos <f> sin <f> si n 8 ^ J 


is zero when 0 and <f> differ by an odd multiple of $ir. 

22. When (A^ A 2 , A a ) and are the direction cosines of two 

lines l and m, prove that the product 



A, A, 

AiAgl : 

<r 

MiMa 

Mi 

|A,A, 

A| 

A, A, 

M1M2 

Ma 

MaMa 

LAiA 3 

A a A a 

A! J 

l-MiMa 

MaMa 

Ms J 


is zero if and only if the lines l and m are perpendicular . 

23. Prove that L % = L when L denotes the first matrix of Example 22. 
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RELATED MATRICES 

1. The transpose of a matrix 

1.1. Definition 13. If A is a matrix of n columns, the matrix 
which, for r = 1 , 2 ,..., n, has the r-th column of A as its r-th row 
is called the transpose of A (or the transposed matrix of A). 
It is denoted by A ' . 

If A = A', A is said to be symmetrical. 

The definition applies to all rectangular matrices. For 
example, 

1 2 
3 4 
.5 6 

for the rows of the one are the columns of the other. 

When we are considering square matrices of order n we shall, 
throughout this chapter, denote the matrix by writing down 
the element in the ith row and ith column. In this notation, 
if A = [o ifc ], then A' = [o fcl ]; for the element in the ith row 
and ith column of A' is that in the ith row and ith column 
of A. 

1.2. Theorem 23. Law of reversal for a transpose. If 
A and B are square matrices of order n, 

(A B)' = B'A'; 

that is to say, the transpose of the product AB is the product of 
the transposes in the reverse order. 

In the notation indicated in § 1.1, let 

A = = 

so that A' = B’ = [&„]. 

Then, denoting a matrix by the element in the ith row and 
fcth column, and using the summation convention, we have 

' AB = [a y 6 ifc ], (A B)' = [a w &„]. 


is the transpose of 


[1 3 SI, 

1.2 4 ej 
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Now the ith row of B' is b u , b 2i ,..., b nt and the k th column of 
A' is a kv a k2 ,..., a kn ; the inner product of the two is 

n n 

Hence, on reverting to the use of the summation convention, 
the product of the matrices B' and A' is given by 

B'A’ = \b j{ a kj ] = [a w 6 ;< ] 

= (ABy. 

Corollary. If A, 2?,..., K are square matrices of ordeh n, the 
transpose of the product AB...K is the product K'...B'A'. 

By the theorem, 

(ABCY = C'(ABy 
= C'B'A 

and so, step by step, for any number of matrices. 

2. The Kronecker delta 

The symbol S ik is defined by the equations 

k = 0 when i =£ k, S ik = 1 when i = k. 

It is a particular instance of a class of symbols that is exten- 
sively used in tensor calculus. In matrix algebra it is often 
convenient to use to denote the unit matrix, which, as we 
have already said, has 1 in the leading diagonal positions (when 
i — k) and zero elsewhere. 

3. The adjoint matrix and the reciprocal matrix 

3.1. Transformations as a guide to a correct definition. 

Let c numbers 3 * * * * * 9 X and x be connected by the equation 

X = Ax, (1) 

where A symbolizes a matrix [a ik ]. 

When A = |a ifc | ^ 0, x can be expressed uniquely in terms 

of X (Chapter II, §4.3); for when we multiply each equation 
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by the co-factor A u and sum for i = 1, 2 ,..., ra, we obtain 
.2 A u X i = 2 x k ^ 

i-i fc-i i-i 

n 

= y.x k S M ^ (Theorem 10) 

fc-i 

= x t A. 

That is, x t = y (AJA)X i 

i-i 

or x i = 2 (A k ilA)X k - (2 a) 

a-i 

Thus we may write equations (2 a) in the symbolic form 

a; = A-'X, (2) 

which is the natural complement of (1), provided we interpret 
A- 1 to mean the matrix [A ki / A]. 

3.2. Formal definitions. Let A = [< a ik ] be a square matrix 
of order n\ let A, or \A\, denote the determinant |a <fc |; and let 
A tb denote the co-factor of a T3 in A. 

Definition 14. A = [a ifc ] is a singular matrix if A = 0; 
it is an ordinary matrix or a non-singular matrix if A ^ 0. 

Definition 15. The matrix [. A ki ] is the adjoint, or the 
adjugate, matrix of [a ifc ]. 

Definition 16. When [< a ik ] is a non-singular matrix , the 
matrix [-A fci /A] is the reciprocal matrix of [a iJfc ]. 

Notice that, in the last two definitions, it is A ki and not A ik 
that is to be found in the ith row and kth column of the matrix. 

3.3. Properties of the reciprocal matrix. The reason for 
the name ‘the reciprocal matrix * may be found in the properties 
to be enunciated in Theorems 24 and 25. 

Theorem 24. If [a iAr ] is a non-singular square matrix of order 
n and if A r8 is the co-factor of a r3 in A = \a ik |, then 

\. a ik\ * 

and, [AJA]x[a ik ] = I, 
where I is the unit matrix of order n. 


( 1 ) 

( 2 ) 
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The element in the ith row and &th column of the product 

\ a ik\ X \Aki!^\ 1 n 

j-1 

which (by Theorem 10, f p. 30) is zero when i # k and is unity 
when i = k. Thus the product is [8 ifc ] = I. Hence (1) proved. 
Similarly, using now the sum convention, we have 

[•^Aei/^] X \A}i 
= M, 

and so (2) is proved. 

3 . 4 . In virtue of Theorem 24 we are justified, when A denotes 
K], in writing [A k JA] = A-'; 

for we have shown by (1) and (2) of Theorem 24 that, with 
such a notation, , 

AA- 1 = I and A~'A = /. 

That is to say, a matrix multiplied by its reciprocal is equal 
to unity (the unit matrix). 

But, though we have shown that A* 1 is a reciprocal of A , 
we have not yet shown that it is the only reciprocal. This we 
do in Theorem 25. 

Theorem 25. If A is a non-singular square matrix , there is 
only one matrix which , when multiplied by A, gives the unit 
matrix . 

Let R be any matrix such that AR = I and let A _1 denote 
the matrix [A ki j A]. Then, by Theorem 24, A A _1 = /, and hence 

AiR-A- 1 ) = AR-AA- 1 = I-I = 0. 

It follows that (note the next two steps) 

A-'AiR-A- 1 ) = A-'.O = 0 (p. 75, §6.5), 

that is (by (2) of Theorem 24), 

I(R-A - 1 ) = 0, i.e. R—A* 1 = 0 (p. 77, §7.1). 
Hence, if AR = /, R must be A- 1 . 

t For the precise form used here compare Example 6, p. 02. 
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Similarly, if RA = I, we have 

(R-A-')A = RA-A-'A = 1-1 = 0, 
and so (R— A^ X )AA- 1 = 0..4 _1 ^ 0; 

that is (R — A~ X )I = 0, i.e. R — A -1 = 0. 

Hence, if RA = I, R must be A~ l . 

3.5. In virtue of Theorem 25 we are justified in speaking of 
A- 1 not merely as a reciprocal of A, but as the reciprocal of A. 

Moreover, the reciprocal of the reciprocal of A is A itself, or, 
in symbols, = A 

For since, by Theorem 24, A A -1 = A~ X A = I, it follows that 
A is a reciprocal of A~ x and, by Theorem 25, it must be the 
reciprocal. 

4. The index law for matrices 

We have already used the notations A 2 , A 3 ,... to stand for 
AA, AAA,... . When r and s are positive integers it is inherent 
in the notation that 

A r xA' = A r + g . / 

/ 

When A 1 denotes the reciprocal of A and 8 is a posiftve integer, 
we use the notation A~" to denote (-4 -1 )*. Withjthis notation 
we may write A r xA* = A r +‘ / (1) 

whenever r, s are positive or negative integer|, provided that 
A° is interpreted as I. 

We shall not prove (1) in every case; a pioof of (1) when 
r > 8 = —t, and t > r > 0 will be enoifgh to show the 

method. 

Let t = r-\-k, where k > 0. Then 

A r A -* = A r A~ r ~ k = A T A~ T A~ k . 

But A A - 1 = 1 (Theorem 24), 

and, when r > 1, 

A r A~ r = A r - 1 AA- 1 A~ r+1 = A r - x IA~ r+x = A r ~ x A~ T ^, 
so that A r A~ r = A 1 A- 1 = 1. 

Hence _ A T A~ l — IA~ k = A~ k = A r ~ l . 
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Similarly, we may prove that 

( A r y = A” ( 2 ) 

whenever r and s are positive or negative integers. 


5. The law of reversal for reciprocals 

Theorem 26. The reciprocal of a product of factors is the 
product of the reciprocals of the factors in the reverse order; in 
particular, (AS)' 1 = 5 - 14 - 1 , 

(ABC)- 1 = < 7 - 15 - 1 , 4 - 1 , 

(A^B- 1 )- 1 = BA. 

Proof. For a product of two factors, we have 

(5-14-1). (45) = £-14-145 (p. 75, §6.5) 

= 5-i/5 


= 5-i5 = / (p. 77, §7.1). 

Hence 5 - 14-1 is a reciprocal of 45 and, by Theorem 25, it 
must be the reciprocal of 45. 

For a product of threefactors, we then have 

(C-»5-i4 -i)(45C') = <7-1/0 

= <7-i0 = I, 

and so on for any. number of factors. 

) 

Theorem 27. The operations of reciprocating and transposing 
are commutative; that is, 

(A')- 1 = (A- 1 )'. 

Proof. The matrix 4 _1 is, by definition, 

[AM 

so that (4-i)' = [4«/A]. 


Moreover, A' = [a w ] 

and so, by the definition of a product and the results of 


Theorem 10, 


4'. (4-i)' = [8 tt ] = I. 


Hence (A- 1 )' is a reciprocal of 4' and, by Theorem 25, it toust 
be the reciprocal. 
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6. Singular matrices 

If A is a singular matrix, its determinant is zero and so we 
cannot form the reciprocal of A, though we can, of course, form 
the adjugate, or adjoint, matrix. Moreover, 

Theorem 28. The product of a singular matrix by its adjoint 
is the zero matrix . 

Proof. If A — [a ik ] and \a ik \ = 0, then 

.2 a i}A k} = 0, 5 A H a Jk = 0 

1 3=1 

both when i ^ k and when i = k. Hence the products 
[««] X[A ki ] and [A ki ]x [«**] 
are both of them null matrices. 

7. The division law for non-singular matrices 

7.1. Theorem 29. (i) If the matrix A is non-singular, the 
equation AB = 0 implies B = 0. 

(ii) If the matrix B is non-singular , the equation AB = 0 
implies A — 0. 

Proof, (i) Since the matrix A is non-singular, it has a reci- 
procal A -1 , and A- 1 A = I. 

Since AB = 0, it follov^ that 

A~ X AB = A~ 1 X0 — 0. 


But A~ l AB =?= IB — B , and so B = 0. 

(ii) Since B is non-singular, it has a reciprocal JS" 1 , and 
= I. 

Since AB = 0, it follows that 

ABB- 1 = 0 X jB -1 - 0. 

But ABB-* = AI =s= and so ^4 = 0. 

7.2. The division law for matrices thus takes the form 
'If AB = 0, /Aen eiJAer ^4 = 0, or B = 0, or both A and B 
are singular matrices . 9 
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7.3. Theorem 30. If A is a given matrix and B is a non- 
singular matrix ( both being square matrices of order n), there is 
one and only one matrix X such that 

A = BX, • (1) 

and there is one and only one matrix Y such that I 

A = YB. \ (2) 

Moreover, X = B~ l A, Y = AB~ l . \ 

Proof. We see at once that X = B~ l A satisfies (1)\ for 
BB _1 A = I A = A. Moreover, if A = BR, ' 

0 = BR-A = B{R—B~ l A), 

and 0 = B~ l B(R-B~ l A) 

= I(R-B-'A), 

so that R—B^A is the zero matrix. 

Similarly, Y = AB- 1 is the only matrix that satisfies (2). 

7.4. The theorem remains true in certain circumstances when 
A and B are not square matrices of the same order. 

Corollary. If A is a matrix of m rows and n columns , B 
a non-singular square matrix of order m , then 

A = BX 


has the unique solution X = B~ X A . 

If A is a matrix of m rows and n columns , C a non-singular 
square matrix of order n, then 

A = YC 

has the unique solution Y = AC~ l . 

Since B~ x has m columns and A has m rows, B~ x and A are 
conformable for the product B~ Y A, which is a matrix of m rows 
and n columns. Also 


B.B- X A = IA = A f 

where I is the unit matrix of order m. 

Further, if A = BR, R must be a matrix of m rows and 
n columns, and the first part of the corollary may be proved 
as we proved the theorem. 
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The second part of the corollary may be proved in the same 
way. 

7.5. The corollary is particularly useful in writing down 
the solution of linear equations. If A = is a non-singular 
matrix of order n, the set of equations 

b i = 2 a ik x k (i = l,.»,n) (3) 

k = l 

may be written in the matrix form b = Ax, where b, x stand 
for matrices with a single column. 

The set of equations 

b t='l a kiyk (» = i, -,») W 

k = 1 

may be written in the matrix form V = y'A, where V, y* stand 
for matrices with a single row. 

The solutions of the sets of equations are 

a; = A -1 6, y f = b'A~\ 

An interesting application is given in Example 11, p. 92. 


Examples VIII 


1. When 


A = 

'a h g 
h b f 

B = 

x i x % ^a 
2/i 2/a 2/a 


.9 f c. 


z x z % z 3m 


form the products AB, BA, A'B\ B'A' and verify the result enunciated 
in Theorem 23, p. 83. 

Hint. Use X = ax+hy+gz , Y = kx+ by+fz, Z = gx+fy+cz . 

2. Form these products when A , B are square matrices of order 2, 
neither being symmetrical. 

3. When A, 2? are the matrices of Example 1, form the matrices A" 1 , 
and verify by direct evaluations that (AH) -1 = R -1 A -1 . 

4. Prove Theorem 27, that (A') -1 = (A - * 1 )', by considering Theorem 
23 in the particular case when B = A“ x . 

5. Prove that the product of a matrix by its transpose is a sym- 
metrical matrix. 

Hint. Use Theorem 23 and consider the transpose of the matrix AA'. 

6* The product of the matrix [a i/c ] by its adjoint is equal to |a{*|I. 
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Use of submatrices (compare Example 15, p. 81) 


7. Let L = \ A 

Lo 

numbers, and let 

Prove that 


[A 1], M = \p 11, 

Lo AJ Lo pi 


N — [v] f where A, p t v are non-zero 


A = 


0 

M 

0 


a 2 = r l 2 

0 

0 ■ 

A~ l = [L- 1 

0 

0 ' 

0 

M 2 

0 

0 

M- 1 

0 

0 

0 

N 2 _ 

0 

0 

N~ 1 . 


Hint. In view of Theorem 25, the last part is proved by shewing 
that the product of A and the last matrix is I . 

8. Prove that if f(A) = p 0 A n -\-p 1 A n ^ 1 -[- ... +p H9 where p 0 ,..., p n are 
numbers, then f(A) may be written as a matrix 
\f(L) 0 0 

0 f(M) 0 
0 0 f(N) J 

and that, when /(A) is non-singular, {/(A)} -1 or l/f(A) can be written 
in the same matrix form, but with 1 //(L), 1//(M), 1 lf(N) instead of 


f(L),f(M), f(N). 

9. If f(A), g(A) are two polynomials in A and if g(A) is non-singular, 
prove the extension of Example 8 in which / is replaced by f/g. 


Miscellaneous exercises 

10. Solve equations (4), p. 91, in the two forms 

y = (A')~ l b, y' = b'A~' 
and then, using Theorem 23, prove that 

(AT 1 = (A- 1 )'- 

11. A is a non-singular matrix of order n, x and y are single -column 
matrices with n rows, V and m' are single-row matrices with n columns; 

y — Ax, Vx — m'y. 

Prove that m' = J'A" 1 , m = (A _1 )7. 

T . 8 
Interpret this result when a linear transformation = Y fla x k 

k**l 

(i = 1, 2, 3) changes l 1 x l -\-l 2 x 2 -\-l 3 x z into m l y l +m 2 y 2 +m 2 y s and 
(x x ,x z ,x 2 ), (y lf 2/2^3) are regarded as homogeneous point-coordinates in 
a plane. 

12. If the elements a ik of A are real, and if AA* = 0, then A = 0. 

13. The elements a ik of A are complex, the elements d ik of A are the 
conjugate complexes of a ik , and AA* — 0. Prove that A = A — 0. 

14. A and B are square matrices of the same order; A is symmetrical* 
Prove that B’AB is symmetrical. [Chapter X, §4.1, contains a proof 



CHAPTER VIII 
THE RANK OF A MATRIX 

[Section 6 is of less general interest than the remainder of the chapter. 
It may well be omitted on a first reading.] 

I. Definition of rank 

1.1. The minors of a matrix. Suppose we are given any 
matrix A, whether square or not. From this matrix delete all 
elements save those in a certain r rows and r columns. When 
r > 1, the elements that remain form a square matrix of order 
r and the determinant of this matrix is called a minor of A of 
order r. Each individual element of A is, when considered 
in this connexion, called a minor of A of order 1. For 
example, 


the determinant 

a c 

is a minor of 

'a b c 


9 i 


d e f 




.g h i . 


of order 2; a, b,... are minors of order 1. 

1.2. Now any minor of order r+1 (r ^ 1) can be expanded 
by its first row (Theorem 1) and so can be written as a sum of 
multiples of minors of order r. Hence, if every minor of order r 
is zero, then every minor of order r- f 1 must also be zero. 

The converse is not true; for instance, in the example given 
in § 1.1, the only minor of order 3 is the determinant that con- 
tains all the elements of the matrix, and this is zero if a = b, 
d = e, and g = h\ but the minor ai—gc, of order 2, is not 
necessarily zero. 

1.3. Rank. Unless every single element of a matrix A is 
zero, there will be at least one set of minors of the same order 
which do not all vanish. For any given matrix with n rows and 

columns, other than the matrix with every element zero, 
there is, by the argument of § 1.2, a definite positive integer p 
such that 

either p is less than both m and n, not all minors of order 
p vanish, but 411 minors of order p+1 vanish, 
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or p is equal tof min(/?i, n ), not all minors of order p vanish, 
and no minor of order p + 1 can be formed from the matrix. 

This number p is called the rank of the matrix. But the 
definition can be made much more compact and we shall take 
as our working definition the following: 

Definition 17 . A matrix lias rank r when r is thk largest 
integer for which the statement ‘ not all minors of order r are 
zero 9 is valid . 

For a non-singular square matrix of order n, the ranl^ is n\ 
for a singular square matrix of order n, the rank is less than w. 

It is sometimes convenient to consider the null matrix, all 
of whose elements are zero, as being of zero rank. 


2. Linear dependence 

2.1. In the remainder of this chapter we shall be concerned 
with linear dependence and its contrary, linear independence; 
in particular, we shall be concerned with the possibility of 
expressing one row of a matrix as a sum of multiples of certain 
other rows. We first make precise the meanings we shall attach 
to these terms. 

Let a v ..., a m be ‘ numbers \ in the sense of Chapter VI, each 
having n components, real or complex numbers. Let the com- 
ponents be 


( a ll> — > ®in)» ( a i 


ml*’ 


,)• 


Let F be any field J of numbers. Then a x> ..., a m are said to be 
linearly' dependent with respect to F if there is an equation 


l 1 a l + m ..+l m a m = 0, (1) 

wherein all the Ts belong to F and not all of them are zero. 
The equation (1) implies the n equations 

l l a ir+-+lm a mr =0 (r = 1,..., 7l). 

The contrary of linear dependence with respect to F is linear 
independence with respect to F, 

The c number 9 a x is said to be a sum of multiples of the 

t Whoa m = n, min(m, n) = m = n; 

when m=£n, min(m, n) = the smaller of m and «. 

t Compare Preliminary Note, p. 2. 
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'numbers* a 2 ,..., a m with respect to F if there is an equation of 
the form „ 7 „ , , 7 „ , 0 \ 

a i — *2 a 2+‘ (2) 

wherein all the Z’s belong to F. The equation (2) implies the 
n equations 

a lr = h a *+- + L a mr (r = 


2.2. Unless the contrary is stated we shall take F to be the 
field of all numbers, real or complex. Moreover, we shall use 
the phrases ‘linearly dependent’, ‘linearly independent’, ‘is a 
sum of multiples* to imply that each property holds with 
respect to the field of all numbers, real or complex. For 
example, ‘the “ number ” a is a sum of multiples of the “ numbers ” 
b and c* will imply an equation of the form a = lyb+l 2 c, where 
l x and l 2 are not necessarily integers but may be any numbers, 
real or complex. 

As on previous occasions, we shall allow ourselves a wide 
interpretation of what we shall call a ‘ number for example, in 
Theorem 31, which follows, we consider each row of a matrix 
as a ‘ number \ 


3. Rank and linear dependence 
3.1. Theorem 31. If A is a matrix of rank r, where r ^ 1, 
and if A has more than r rows , we can select r rows of A and 
express every other row as the sum of multiples of these r rows . 

Let A have m rows and n columns. Then there are two 
possibilities to be considered: either (i) r = n and m > n, or 
(ii) r < n and, also, r < m. 

(i) Let r = n and m > n. 

There is at least one non -zero determinant of order n that 
can be formed from A. Taking letters for columns and suffixes 
for rows, let us label the elements of A so that one such non- 
zero determinant is denoted by 

A = (Xj by . • ky 

flj ^2 * • k 2 

Qn b n . k n 

With this labelling, let n+8 be the suffix of any other row. 
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Then (Chap. II, §4.3) the n equations 

a, i 1 + a 2 ^2+ , - _ l" a 7i^n = a n+6> ' 

b 1 l 1 +b 2 l 2 +...+b n l n — 6 n +0> ^ 

iiZi+& 2 ^2+- -+^»^n — k n +6 ) 
have a unique solution in l v l n given by 

l x = ^n)/( a l^2 ••• ^n)> 

Z 2 = (a x b n+ 0 C 3 ...Hc n )/(a 1 b 2 ... k n )y 

and so on. Hence the rowf with suffix n-\-9 can be expressed 
as a sum of multiples of the n rows of A that occur in A. This 
proves the theorem when r = n < m. 

Notice that the argument breaks down if A = 0. 

(ii) Let r < n and, also, r < m. 

There is at least one non-zero minor of order r. As in (i), take 
letters for columns and suffixes for rows, and label the elements 
of A so that one non-zero minor of order r is denoted by 



M = 


b\ ■ 

■ Pi 



°a 


• Pi 



a T 

b r . 

• Pr 


With this labelling, let the remaining rows of A have suffixes 
r+ 1, r+ 2,..., m.— 

Consider now the determinant 


D = 

°i • 

• Pi 

“l 


a r . 

• Pt 



a T+0 • 

■ Pr+e 

<Xr+9 


where 0 is any integer from 1 to m— r and where a is the letter 
of any column of A. 

If a is one of the letters a,..., p, then D has two columns 
identical and so is zero. 

If a is not one of the letters a,..., p, then ±D is a minor of 
A of order r+ 1; and since the rank of A is r, D is again zero. 

t Using p t to denote the row whose letters have suffix t, we may write 
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Hence D is zero when a is the letter of any column of A, 
and if we expand D by its last column, we obtain the equation 

^ a r+0+^l a l+^2 a 2+-‘-+\-“r = (2) 

where M ^ 0 and where M, Aj,..., are all independent of a. 
Hence (2) expresses the row with suffix r+0 as a sum of 
multiples of the r rows in M ; symbolically, 

Pr+9= 

This proves the theorem when r is less than m and less than n. 

Notice that the argument breaks down if M = 0. 

3.2. Theorem 32. If A is a matrix of rank r , it is impossible 
to select q rows of A, where q < r, and to express every other row 
as a sum of multiples of these q rows . 

Suppose that, contrary to the theorem, we can select q rows 
of A and then express every other row as a sum of multiples 
of them. 

Using suffixes for rows and letters for columns, let us label 
the elements of A bo that the selected q rows are denoted by 
Pi> M Pq' Then, for k = 1,..., m, where m is the number of 
rows in A , there are constants A^ such that 

Pk = *lkPl+"- + \kPq> (3) 

for if k = 1 ,..., q, (3) is satisfied when \ k = S /Jfe , 
and if k > q, (3) expresses our hypothesis in symbols. 

Now consider any (arbitrary) minor of A of order r. Let the 
letters of its columns be a,..., 6 and let the suffixes of its rows 
be k r . Then this minor is 

a l • 4“ \k x @q j 


which is the product by rows of the two determinants 


^lfc, • 


. o , 

a l • 

. 0L a 0 . 


• • 4 i 

• \kt 0 ■ 

. 0 


. e q o . 


each of whioh has r— q column r of zeros. 

4702 o 
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Accordingly, if our supposition were true, every minor of A 
of order r would be zero. But this is not so, for since the rank 
of A is r, there is at least one minor of A of order r that is not 
zero. Hence our supposition is not true and the theorem is 

established. ( 

I 

4. Non-homogeneous linear equations \ 

4.1. We now consider ' 

n 

2 a a x i = b{ (i = ( 1 ) 

t = 1 

a set of m linear equations in n unknowns, x v :.. } x n . Such a set 
of equations may either be consistent, that is, at least one set of 
values of x may be found to satisfy all the equations, or they 
may be inconsistent, that is, no set of values of x can be found 
to satisfy all the equations. To determine whether! the equa- 
tions are consistent or inconsistent we consider the matrices 


A = 

'®n • 

■ a m " 

, B = 

a il 

a m 

V 


- a m\ • 

• ^mn. 


- a ml • 

• Q'mn 

b m. 


We call B the augmented matrix of A . 

Let the ranks of A, Bber, r' respectively. Then, since every 
minor of A is a minor of B } either r = r' or r < r f [if A has 
a non-zero minor of order r, then so has B ; but the converse 
is not necessarily true]. 

We shall prove that the equations are consistent when r = r’ 
and are inconsistent when r < r\ 

4.2. Let r < r\ Since r' ^ w, r < m . We can select r rows 
of A and express every other row of A as a sum of multiples 
of these r rows. 

Let us number the equations (1) so that the selected r rows 
become the rows with suffixes I,..., r. Then we have equations 
of the form 

^r+Qj = = n ) (^) 

wherein the A’s are independent of U 

Let us now make the hypothesis that the equations (1) are 
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consistent. Then, on multiplying (2) by x t and summing from 
t = 1 to t = n, we get 

b r +3 = ( 3 ) 

The equations (2) and (3) together imply that we can express 
any row of B as a sum of multiples of the first r rows, which 
is contrary to the fact that the rank of B is r f (Theorem 32). 
Hence the equations (1) cannot be consistent. 

4.3. Let r' = r. If r = m, (1) may be written as (5) below. 

If r < m, we can select r rows of B and express every other 
row as a sum of multiples of these r rows. As before, number 
the equations so that these r rows correspond to i = 1,..., r. 
Then every set of x that satisfies 


= (*" = !>•••. f) (4) 

t = 1 


satisfies all the equations of (1). 

Now let a denote the matrix of the coefficients of x in (4). 

Then, as we shall prove, at least one minor of a of order r must 

have a value distinct from zero. For, since the fcth row of A 

can be written as * . , % 

™lk Pi i ■ ■ ■ "r 4r k Pr> 


every minor of A of order r is the product of a determinant 
I'WI by a determinant \a Lk *\ 9 wherein i has the values 1,...* r 
(put q = r in the work of §3.2); that is, every minor of A of 
order r has a minor of a of order r as a- factor. But at least 
one minor of A of order r is not zero and hence one minor of 
a of order r is not zero. 

Let the suffixes of the variables x be numbered so that the 
first r columns of a yield a non-zero minor of order r. Then, 
in this notation, the equations (4) become 


J,a u xi = bi- 2 a u x t (* = 1,..., r), (5) 

(=1 . f=>r+l 

wherein the determinant of the coefficients on the left-hand 
side is not zero. The summation on the right-hand side is 
present only when n > r. 

If n > r, we may give x r+1 ,..., x n arbitrary values and then 
solve equations (5) uniquely for x r . Hence, if r => r' and 
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n > r, the equations (1) are consistent and certain sets of n— r 
of the variables x can be given arbitrary values.! 

If n = r, the equations (5), and so also the equations (1), are 
consistent and have a unique solution. This unique solution 
may be written symbolically (Chap. VII, §7.5) as ;; 

* = \ 

where A-^ is the matrix of the coefficients on the left-hand side 
of (5), 6 is the single-column matrix with elements b v ..., b n , 
and x is the single<-column matrix with elements x v ..., x n . 

5. Homogeneous linear equations 
The equations 

2 a u x t =0 (i = 1,..., to), (6) 

i=i 

which form a set of to homogeneous equations in n unknowns 
x v ..., x n , may be considered as an example of equations (1) 
when all the 6 { are zero. The results of §4 are immediately 
applicable. 

Since all b t are zero, the rank of B is equal to the rank of A, 
and so equations (6) are always consistent. But if r, the rank 
of A, is equal to n, then, in the notation of §4.3, the only 
solution of equations (6) is given by 

x = A^b, 6 = 0. 

Hence, when r = n the only solution of (6) is given by * = 0, 
that is, Xi = * a = ... = x n = 0. 

When, r < n we can, as when we obtained equations (6) of 
§ 4.3, reduce the solution of (6) to the solution of 

r n 

^a^x, = — ^ ^a u x t (i = 1 r), * (7) 

wherein the determinant of the coefficients on the left-hand 

side is not zero4 In this case, all solutions of (6) are given by 

, '1 

t Not aU sets of n— r of the variables may be given arbitrary values; the 
criterion ia that the coefficients of the rem aining r variables should provide 
a non -zero determinant of order r. 

t The notation of (7) is not necessarily that of (0): the labelling of the 
elements of A and the numbering of the variables x has followed the proceduffr 
laid down in § 4.3. 
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assigning arbitrary values to z r+1 ,..., x n and then solving (7) for 
x v ..; x r . Once the values of x r+1 ,..., x n have been assigned, the 
equations (7) determine x r uniquely. 

In particular, if there are more variables than equations, 
i.e. n > m, then r < n and the equations have a solution other 
than x Y = ... = x n = 0. 


6. Fundamental sets of solutions 
6.1. When r, the rank of A, is less than n, we obtain n— r 
distinct solutions of equations (6) by solving equations (7) for 
the n—r sets of values 


x r+l = 1 , 

x r+l = 0, 

.... x r+1 = 0, 

X r+2 = 0 , 

X r+ 2 ~ 

•••> x r + 2 = 


= 0, x n = 0, ..., = 1. 


( 8 ) 


We shall use X to denote the single-column matrix whose ele- 
ments are x v ..., x n . The particular solutions of (6) obtained 
from the sets of values (8) we denote by 


X* (k = 1, 2,..., n-r). (9) 


As we shall prove in § 6.2, the general solution X of (6), which 
is obtained by giving x r+v .„ 9 x n arbitrary values in (7), is 
furnished by' the formula 


X — x r+k X fc . 


( 10 ) 


That is to say, (a) every solution of (6) is a sum of multiples 
of the particular solutions X fc . Moreover, (6) no one X fc is a sum 
of multiples of the other X fc . A set of solutions that has the 
properties (a) and ( b ) is called a fundamental set of solu- 
tions. 

There is more than one fundamental set of solutions. If we 
replace the numbers on the right of the equality signs in (8) by 
the elements of any non-singular matrix of order n—r, we are 
led to a fundamental set of solutions of (6). If we replace these 
numbers by the elements of a singular matrix of order p — r, 
we are led to n—r solutions of (6) that do not form a funda- 
mental set of solutions. This we shall prove in §6.4. 



102 


THE RANK OF A MATRIX 


6.2. Proof f of the statements made in §6.1. We are 

concerned with a set of m homogeneous linear equations in n 
unknowns when the matrix of the coefficients is of rank r and 
r <n. As we have seen, every solution of such a set is a solu- 
tion of a set of equations that may be written in t|ie form 

r 7i \ 

2w=- 2 a u x t (» = i.» •>*•). \ ( 7 ) 

t= 1 t^r+1 

wherein the determinant of the coefficients on thb left-hand 
side is not zero. We shall, therefore, no longer consider the 
original equations but shall consider only (7). 

As in §6.1, we use X to denote x v ..., x u considered as the 
row elements of a matrix of one column, X*. to denote the 
particular solutions of equations (7) obtained from the sets of 
values (8). Further, we use l k to denote the &th column of (8) 
considered as a one-column matrix, that is, l k has unity in its 
Jcth row and zero in the remaining n—r— 1 rows. Let A x denote 
the matrix of the coefficients on the left-hand side of equations 
(7) and let A± 1 denote the reciprocal of A v 
In one step of the proof we use X£+ s to denote x ai ..., x a + a 
considered as the elements of a one-column matrix. 

The general solution of (7) is obtained by giving arbitrary 
values to x r+1 ,..., x n and then solving the equations (uniquely, 
since A 1 is non-singular) for x v ..., x r . This solution may be 
symbolized, on using the device of sub-matrices, as 



But the last matrix is 



n-r 

= 2 x r+k ^k> 
k*= 1 


f Many readers will prefer to omit these proofs, at least on a first reading. 
The remainder of § 0 does not make easy reading for a beginner. 
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and so we have proved that every solution of (7) may be repre- 
sented by the formula 

-- X = 2 x r+k X k . (10) 

Zc — 1 

6.3. A fundamental set contains r solutions. 

6.31. We first prove that a set of less than n— r solutions 
cannot form a fundamental set. 

Let Yj,..., Y p , where p < n— r, be any p distinct solutions of 
(7). Suppose, contrary to what we wish to prove, that Y £ form 
a fundamental set. Then there are constants X ik such that 

= (k = 1,..., n-r), 

i—i 

where the X k are the solutions of §6.1 (9). By Theorem 31 
applied to the matrix whose rank cannot exceed p, we 

can express every X k as a sum of multiples of p of them (at 
most). But, as we see from the tqble of values (8), this is 
impossible; and so our supposition is not a true one and the 
\ i cannot form a fundamental set. 

6.32. We next prove that in any set of more than n— r solu- 
tions one solution (at least) is a sum of multiples of the re- 
mainder. 

Let Y lf ..., Y p , where p > n— r, be any p distinct solutions of 
(7). Then, by (10), 

Y < = 2 RfcXfc (i = l 

where pL ik is the value of x r + k in Y £ . Exactly as in §6.31, we 
can express every Y. £ as a sum of multiples of n— r of them 
(at most). 

6.33. It follows from the definition of a fundamental set and 
from §§6.31 and 6.32 that a fundamental set must contain n— r 
solutions. 

6.4. Proof of statements made in §6.1 ( continued ). 

6.41. If [c ik ] is a non -singular matrix of order w— r, and if 

X{ is the solution of (7) obtained by taking the values 

a? f +l = C iV X r+2 — 2 , X n — 
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then, by what we have proved in §6.2, 


x 'i = Z c ik x * (» = 1 >-> n—r). 

.fc=i 

On solving these equations (Chap. VII, §3.1), we get 

X i = Z Vik x t (i = 1,..., n— r), \ (11) 

a = l [, 

where [y iA: ] is the reciprocal of [c ifc ]. ' 

Hence, by (10), every solution of (7) may be written in the 
form n _ r n _ r 

x = -I*r +i In*x; 


*=i 


n—r.n—r x 

= 2 ( 2 X r+iyik\Xk' 

k=l'i = i > 

Moreover, it is not possible to express any one XJ as a sum 
of multiples of the remainder. For if it were, (11) would give 
equations of the formf 


x i = I 1 Pik x k (i = l,-i n—r) 

and these equations, by the argument of §6.31, would imply 
that one X* could be expressed as a sum of multiples of the 
others. 

Hence the solutions X* form -a fundamental set. 

6.42. If [c ih ] is a singular matrix of order n— r, then, with 
a suit&ble labelling of its elements, there are constants such 
that 

c P +e,k = ±ha c ,k & = 1,..., n—r), 

where p is the rank of the matrix, and so p < n—r. 

But, by (10), 

x 'p+o ="Z c p+e.k X* (6 = 1,..., n-r-p) 


and x i = 2c 4fc X fc (a = l,...,p). 

1 

^ t We have supposed the solutions to be labelled so that Xi_r can be written 
$w a mm of multiples of the rest. / 
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Hence there is an equation 

x; +fl = iA* s x; 

and so the XJ cannot form a fundament al set of solutions. 

6.5. A set of solutions such that no one can be expressed 
as a sum of multiples of the others is said to be likeably 

INDEPENDENT. 

By what we have proved in §§ 6.41 and 6.42, a set of solutions 
that is derived from a non-singular matrix [c ffc ] is linearly 
independent and a set of solutions that is derived from a singu- 
lar matrix [c* fc ] is not linearly independent. 

Accordingly, if a set of n~ r solutions is linearly independent, 
it must be derived from a non-singular matrix [c ik ] and so, by 
§6.41, such a set of solutions is a fundamental set. 

Hence, any linearly independent set of n—r solutions is a 
fundamental set . 

Moreover, any set of n—r solutions that is not linearly in- 
dependent is not a fundamental set , for it must be derived from 
a singular matrix [ c ik ]. 


Examples IX 

Devices to reduce the calculations when finding the rank of a 
matrix 

1. Prove that the rank of a matrix is unaltered by any of the following 
changes in the elements of the matrix: 

(i) the interchange of two rows (columns), 

(ii) the multiplication of a row (column) by a non-zero constant, 

(iii) the addition of any two rows. 

Hint. Finding the rank depends upon showing which minors of the 
matrix are non-zero determinants. Or, more compactly, use Examples 
VII, 18-20, and Theorem 34 of Chapter IX. 

2. When every* minor of [a^] of order f~ f-1 that contains the first * 
r rows and r columns is zero, prove that there are constants such 
that the Ibth row of [o ifc ] is given by 

Pk == JE^ikPi- 

Prove^by the method of § 3.2, or otherwise) that every minor of {<***!> 
of order r-f I is then zero. - . ,1’ 
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3. By using Example 2, prove that if a minor of order r is non-zero 
and if every minor obtained by bordering it with the elements from an 
arbitrary column and an arbitrary row are zero, then the rank of the 
matrix is r. 


4. Rule for symmetric square matrices. If a principal minor of 
order r is non-zero and all principal minors of order r+| are zero, then 
the rank of the matrix is r. (See p. 108.) i 

Prove the rule by establishing the following results foi [o ifc ]. 

(i) If A ik denotes the co-factor of a ik in \ 


a ll 

. a lr 

8 

a i t 

a rl 

. Orr 

Of® 

°Tt 

a Bl 

• a 9T 



a tl • 

■ a tr 

a<8 

a tt 


(a ik = o fct ); 


then A„A t t— A t8 A 8t = MC, where M is the complementary minor of 
a a» a tt— a ta a tt in O (Theorem 18). 

(ii) If every C = 0 and every corresponding A 88 = 0,jthen every 
A 8t = A t8 = 0 and (Example 2) every minor of the complete matrix 
[o <fc ] of order r+1 is zero. 

(iii) If all principal minors of orders r+1 and r+2 are zero, then the 
rank is r or less. (May be proved by induction.) 


Numerical examples 

5. Find the ranks of the matrices 


(i) 

1 

3 

4 

T 

. ‘ (i») 

’4 

5 

61, 


2 

4 

5 

8 


6 

6 

7 


3 

1 

2 

3. 

H 

.7 

8 

9. 


(iii) 


2 

1 

3 

4' 

. (iv) 

' 6 

1 

3 

8 

5 

8 

1 

4 


4 

2 

6 

-1 

6 

5 

8 

1 


10 

3 

9 

7 

3 

8 

7 

2 

““ 

16 

4 

12 

15 


Ana . (i) 3, (ii) 2, (iii) 4, (iv) 2. 

Geometrical applications 

6/ ^he homogeneous coordinates of a point in a plane are (#, y,z)> 
Prove that the three points (x tt y r , av), r *= 1, 2, 3, are collinear or non- 

+oollinear according as the rank of the matrix 

■aft*'- - ■ * 


iaSor 8 


[ *i Vi h' 
x t y t 

*i V» *». 
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7. The -coordinates of an arbitrary point can be written (with the 
notation of Example 6) in the form 

. 3 " = 

V = At/i-f w % +vy» 
z = vz a , 

provided that the points 1, 2, 3 are not collinear. 

Hint. Consider the matrix of 4 rows and 3 columns and use 
Theorem 31. 

8. Give the analogues of Examples 0 and 7 for the homogeneous 
coordinates (x, y t z, w) of a point in three-dimensional space. 

9. The configuration of three planes in space, whose cartesian equa- 
tions are 

a 4l x+a ia 2/+a 43 z = 6 4 (i = 1, 2, 3), 

is determined by the ranks of the matrices A == [a ik ] and the augmented 
matrix B (§4.1). Check the following results. 

(i) When |a ik | ^ 0, the rank of A is 3 and the three planes meet in 
a finite point. 

(ii) When the rank of A is 2 and the rank of B is 3, the planes have 
no finite point of intersection. 

The planes are all parallel only if every A ik is zero, when the rank 
of A is 1. When the rank of A is 2, 

either (o) no two planes are parallel, 

or (6) two are parallel and the third not. 

Since |o tt [ = 0, 

-^11 : -^13 : -^13 — ^31 : -^28 : -^23 = -^31 ; *^32 ' -^38 

(Theorem 12), and if the planes 2 and 3 intersect, their line of inter- 
section has direction cosines proportional to A llf A 1V A lr Hence in 
(a) the planes meet in three parallel lines and in (6) the third plane, 
meets the tWo parallel planes in a pair of parallel lines. 

The configuration is three planes forming a triangular prism; as a 
special case, two faces of the prism may be parallel. 

(iii) When the rank of A is 2 and the rank of B is 2, one equation 
is a sum of multiples of the other two. The planes given by these two 
equations are not parallel (or the rank of A would be 1), and so the 
configuration is three planes having a common line of intersection. 

(iv) When the rank of ii is 1 and the rank of B is 2, the three^danes 

wpai^OieL v ■ * ' : h , * . - 

(v) Wh<Sn the rank of il is 1 and the rank of B is 1, all threeequa- 

tions represent one and the same plane. * < 
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10. Prove that the rectangular cartesian coordinates (X lf X v X 3 ) of 
the orthogonal projection of {x v x 2 , x 3 ) on a line through the origin having 
direction cosines ( l v l 2 , Z 3 ) are given by 

X = l\ l x l 2 kMx, 

ll l 2 l\ Z a Zg 
A ^3 ^2 Z 3 l\ . 

where X , x are single column matrices. ^ 

11. If L denotes the matrix in Example 10, and M denotes a corre- 
sponding m matrix, show that v 

L l — L and M 2 = M \ 

and that the point ( L+M)x is the projection of a; on a line in the plane 
of the lines Z and m if and only if Z and m are perpendicular. 

Note. A ‘principal minor’ is obtained by deleting corresponding rows 
and columns. 



CHAPTER IX 

DETERMINANTS. ASSOCIATED WITH MATRICES 

1. The rank of a product of matrices 
1.1. Suppose that A is a matrix of n columns, and B a matrix 
of n rows, so that the product AB can be formed. When A has 
rows and B has n 2 columns, the product AB has rows 
and n 2 columns. 

If a ik , b ik are typical elements of A, B, 

n 

C ik = 2 a ij bjk 
3—1 

is a typical element of AB. 

Any /-rowed minor of AB may, with a suitable labelling of 
the elements, be denoted by 

C 11 C 12 ■ • C l/ 

C 21 C 22 • C 2 i 

c tl c t2 . . c u 

When t = n (this assumes that ^ n, n 2 ^ n), A is the 
product by rows of the two determinants 

°n • • °ln > &11 • &nl » 

®nl • • a nn ^ln ■ • ^nn 

that is, A is the product of a /-rowed minor of A by a /-rowed 
Minor of B. 

When t ^ n, A is the product by rows of the two arrays 

°11 • • a ln ^11 • • ^nl 

• i i ■ • • • • 

% • • by . . btf 

Hence (Theorem 15), when / > n, A is zero; and when t < n, 
A is the sum of all the products of corresponding determinants ,, 
U 1 order / that can be formed from the two arrays (Theorem 14),. 

Hence every minor of A B of order greater than n is zero, and 
every m in or of order / < n is the sum of products of a /-rowed,. 
Minor of A by a /-rowed minor of B. 
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Accordingly, 

Theorem 33. The rank of a product AB cannot exceed the 
rank of either factor. 

For, by what we have shown, (i) every minor bf AB with 
more than n rows is zero, and the ranks of A arid B cannot 
exceed n, (ii) every minor of A B of order t ^ n is the product 
or the sum of products of minors of A and B of order t. If all 
the minors of A or if all the minors of B of order r are zero, 
then so are all minors of A B of order r. ' 

1 .2. There is one type of multiplication which.gives a theorem 
of a more precise character. 

Theorem 34. If A has m rows and n columns and B is a non- 
singular square matrix of order n, then A and AB have the same 
rank; and if C is a non-singular square matrix of ordeV m, then 
A and CA have the same rank. 

If the rank of A is r and the rank of AB is />, then p ^ r. 
But A = AB.B- 1 and so r, the rank of A, cannot exceed p and 
n, which are the ranks of the factors AB and B~ x . Hence p = r. 
The proof for CA is similar. 

2. The characteristic equation; latent roots 
2.1. Associated with every square matrix A of order n is the 
matrix A — XI, where I is the unit matrix of order n and X is 
any number, real or complex. In full, this matrix is 

ra u — A a 12 . . a ln 

I °8i « 22 — A ._ . a, in 

I ' 

L a nl a nS ■ • a nn — 

The determinant of this matrix is of the form 

/(A) = (-mx»+ Pl x«- x +...+p n ), (l) 

where the p r are polynomials in the n s elements a ik . The roots 
of the equation /(A) = 0, that is, of 

/ \A-XI\ = 0, (2) 

ii'wce called the latent roots of the matrix A nod the equation 
itself is called the characteristic equation of idi # . ;. ;fl 
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2.2. Theorem 35. Every square matrix satisfies its <non 
characteristic equation; that is, if 

\A-XI\ 

then A n + Pl A n -'+...+ Pn _ l A+ Pn I = 0. 

The adjoint matrix of A — XI, Bay B, is a matrix whose ele- 
ments are polynomials in A of degree n—1 or less, the coeffi- 
cients of the various .powers of A being polynomials in the a ik . 
Such a matrix can be written as 

5 0 + B t A+ . .. + B n _ 2 A»- 2 + B n . k A"- 1 , (3) 

where the B T are matrices whose elements are polynomials in 
the 

Now, by Theorem 24, the product of a matrix by its ad- 
joint = determinant of the matrix X unit matrix. Hence 

(A-XI)B = \A-XI\XI 

= (-l)n(X"+ Pl \»-i+...+p n )I, 
on using the notation of (1). Since B is given by (3), we have 
(A -XI)(B 0 + £, A+ . . . + i? n _ 2 A«- 2 + A-*) 

= (-l)»(\»+ Pl X»-i+...+ Pn )l. 

This equation is true for all values of A and we may therefore 
equate coefficients! of A; this gives us 

-B n _, = (-!)«/ (IB^ = B n _f), 

= ( — l) n JJi I, 

—B n _ 3 +AB n _ t = (— l) n p 2 /, „ _ 

— B 0 -\-AB k = ( — l) W Pn-l-^* 

AB 0 = (—l)» Pn ,I. r f ' 

t W© may regard the equation as the conspectus of the n* equations 
(%-A8 a )(bj?+...+bS“ l >A«- 1 ) 

bo that the equating of coefficients is really the ordinary algebraical procedure 
of equating coefficients* but doing it n* timA for each power of A. 
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Hence we have 

A'+Pt A n ~ x + . . . ■ +3?„_i A +p n I 

= A n I+A n ~ 1 p 1 I+...+A.p n _ l I+p n I 
= (-m-A”B n _ 1 +A*-i(-B n _ 2 +AB n _A+...+ 

-\-A( — B 0 -\-ABf)^-AB^ 

= 0 . 

2.3. If A x , A 2 ,..., X n are the latent roots of the matlpx A, so 
that \A-XI\ = (— l) n (A— A X )(A— Aj)...(A— A„) 

= (-ly^+AA^+.-.+pJ, 
it follows that (Example 13, p. 81), 
(ii-A 1 /)(ri-A 2 /)...(ri-A n 7) = A”+ Pl A'-'+...+p n I = 0. 
It does not follow that any one of the matrices A— X r I is the 

zero matrix. . 

I 

3. A theorem on determinants and latent roots 

3.1. Theorem 36. If g(t) is a polynomial in t, and A is a 
square matrix, then the determinant of the matrix g(A) is equal to 
the product g(A 1 )g(A 2 )... g (A n ), 

where X v A 2 ,..., A n are the latent roots of A. 

Let g(t) = 

so that g(A) = c(A — ^7). ..(A— t m I). 

Then, by Chapter VI, § 10, 

|g(ri)| = c»|ri-* 1 /|X...X|ri-* m /| 

= (-l) mn c n /(«i)-/(U 

== C n TJ (A, 0— IT (K ^ro) 

r-1 r-1 

=n^)- 

P' The theorem also holds when g(A) is a rational function 
jg^{A)jg z {A) provided that g t (A) is non-singular. 

1; 3.2. Further, let g^A) and g t (A) be two polynomials in the 
:i 3 aatxix A, g s (A) being non-singular. Then the matrix 

A J-{g x {A)IUA% 
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is a rational function of A with a non-singular denominator 
g 2 (A). Applying the theorem to this function we obtain, on 
writing g x (A)/g 2 (A ) = g(A), 

\M-g(A)\ = fl{X-g(K)}- 

r =>1 

From this follows an important result: 

Theorem 30. Corollary. If A 1} ..., A n are the latent roots of 
the matrix A and g(A) is of the form g 1 (A)/g 2 (A), where g lf g 2 are 
'polynomials and g 2 (A) is non-singular , the latent roots of the 
matrix g(A) are given by g( \). 

4. Equivalent matrices 

4.1. Elementary transformations of a matrix to stan- 
dard form. 

Definition 18. The elementary transformations of a 
matrix are 

(i) the interchange of two rows or columns , 

(ii) the multiplication of each element of a row (or column) by 
a constant other than zero , 

(iii) the addition to the elements of one row (or column) of a 
constant multiple of the elements of another row (or column). 

Let A be a matrix of rank r. Then, as we shall prove, it can 
be changed by elementary transformations into a matrix of the 


\.L 

-°-l 

L o 

o J 


where I denotes the unit matrix of order r and the zeros denote 
null matrices, in general rectangular. 

In the first place, t elementary transformations oftype (i) 
replace A by a matrix such that the minor formed by the 
elements common to its first r rows and r columns is not equal 
to zero. Next, we can express any other row of A as a sum of 
multiples of these r rows; subtraction of these multiples of the 
r rows from the row in question will therefore give a row of 
zeros. These transformations, of type (iii), leave the rank of the 

t The argument that follows is based on Chapter VIII, $3. 

470# q 
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matrix unaltered (cf. Examples IX, 1) and the matrix now 
before us is of the form 


L o ! o J 


where P denotes a matrix, of r rows and columnp, such that 
|P| ^=0, and the zeros denote null matrices. \ 

By working with columns where before we workeq with rows, 
this can be transformed by elementary transformations, of type 

<“>■ *° r p • o i. \ 

L 0 ■ 0 J 

Finally, suppose P is, in full. 


r°i 

b i 

C 1 

• ftl 

CM 

Q 

b 2 

c 2 

. ft 

i — 

p 

K 

c r . 

■ ft- 


Then, again by elementary transformations of types (i) and 
(iii), P can be changed successively to 


o x 0 0 

°2 Pi Yz 

a r Pr Yr 
0 0 
0 ft 0 

ft Pr ft 

and so, step by step, to a matrix having zeros in all places other 
• than the principal diagonal and non-zerosf in that diagonal. 

■■■■ A final series of elementary transformations, of type (ii), 
presents us with /, the unit matrix of order r. 

4 . 2 . As Examples VII, 18-20, show, the transformations 
^•envisaged in § 4.1 can be performed by pre- and post-multiplica* 
tion of A by non-singular matrices. Hence we have proved that 

’ 1 \ f If a diagonal element were aero the rank of P would be lea* than 
these tranaformationa leave the rank unaltered. 
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when A is a matrix 
B and C such that 


of rank r, there are non-singular matrices 


BAG = I' r , 


where I' r denotes the unit matrix of order r bordered by null 
matrices. 


4.3. Definition 19. Two matrices are equivalent if it is * 
possible to pass from one to the other by a chain of elementary 
transformations. 

If A x and A 2 are equivalent, they have the same rank and 
there are non-singular matrices B v B 2 , C x , C 2 such that 
B X A X C X = /;= B 2 A 2 C 2 . 

From this it follows that 

A x = B X 1 B 2 A 2 C 2 C X 1 

= la 2 m, 

say, where L = B X *B 2 and so is non-singular, and M = C 2 Cf 1 
and is also non-singular. 

Accordingly, when two matrices are equivalent each can be 
obtained from the other through pre- and post-multiplication 
by non-singular matrices. 

Conversely, if A 2 is of rank r, L and M are non-singular 
matrices, and A x = LA 2 M-, 

then, as we shall prove, we can pass from A 2 to A v or from 
A x to A 2t by elementary transformations. Both A z and A x are 
of rank r (Theorem 34). We can, as we saw in §4.1, pass from 
A 2 to /' by elementary transformations; we can pass from A x 
to J' by elementary transformations and so, by using the inverse 
operations, we can pass from J' to A x by elementary trans- 
formations. Hence we can pass from A 2 to A lf or from A x . to 
A 29 by elementary transformations. 

4.4, The detailed study of equivalent matrices is of funda- 
mental importance in the more advanced theory. Here we . 
have done no more than outline some of the immediate con- 
sequences of the definition, t 

t The reader who Intends to pursue the subject seriously should consult 
H. W* l^irnbulland A. C. Aitken, An Introduction to the Theory of Canonical 

X932). 




PART III 


LINEAR AND QUADRATIC FORMS 
INVARIANTS AND COVARIANTS 




CHAPTER X 

ALGEBRAIC FORMS: LINEAR TRANSFORMATIONS 

1. Number fields 

We recall the definition of a number field given in the pre- 
liminary note. 

Definition 1. A set of numbers, real or complex, is called 
a field of numbers, or a number field, when, if r and s belong 
to the set and 8 is not zero, 

r+s, r—8, rxs, r-^s 
also belong to the set. 

Typical examples of number fields are 

(i) the field of all rational real numbers (g r say); 

(ii) the field of all real numbers; 

(iii) the field of all numbers of the form <z+6V5, where a and 
b are rational real numbers; 

(iv) the field of all complex numbers. 

Every number field must contain each and every number 
that is contained in g, (example (i) above); it must contain 1, 
since it contains the quotient a/a, where a is any number of 
the set; it must contain 0, 2, and every integer, since it contains 
the difference 1—1, the sum 1 + 1, and so on; it must contain 
every fraction, since it contains the quotient of any one integer 
by another. 

2. Linear and quadratic forms 

2 . 1 . Let g be any field of numbers and let a i} , be deter- 
minate numbers of the field; that is, we suppose their valued 
to be fixed. Let x T , y r> ... denote numbers that are not to be 
thought of as fixed, but as free to be any, arbitrary, numbers 
from a field g 1( not necessarily th6 same as g. The numbers 
a n> by,— we call constants in g; the symbols x r , y r ,.„ we call 
variables in g r 

An expression such as - 

, $ * « * v ' , , i~% v- ■ v/ 
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is said to be a linear form in the variables x } ; an expression 
such as m - 


.2 


( 2 ) 


is said to be a bilinear form in the variables xJ and y^\ an 
expression such as 1 

.2 2 a n x i x j ( a ij = a n) \ (3) 

<= 1 j=l \ 

is said to be a quadratic form in the variables In each 
case, when we wish to stress the fact that the constants a tj 
belong to a certain field, g say, we refer to the form as one 
with coefficients in gf- 

A form in which the variables are necessarily real numbers 
is said to be a ‘form in real variables’; one in which both 
coefficients and variables are necessarily real numbers is said 
to be a ‘real form’. 


2.2. It should be noticed that the term ‘quadratic form 9 is 
used of (3) only when a tj = a jt . This restriction of usage is 
dictated by experience, which shows that the consequent theory 
is more compact when such a restriction is imposed. 

The restriction is, however, more apparent than real: for ao 
expression such as 


.2 

t ■= 1 j 1 


wherein by is not always the same as b jit is identical with the 
quadratic form n n 

.2 >« = %) 

when we define the o’b by means of the equations 
a u — b u , a i} = 

For example, x 1 +^xy-\-2y i is a quadratic form Jn the two 
variables x and y, having coefficients 

®u = = 2 , Oj, — <*21 = §• 

, 2.3. Matrices associated with linear and quadratic 

lornu. The symmetrical square matrix A s having «« 
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in its ith.row and jth column, is associated with the quadratic 
form (3): it is symmetrical because a (i = a p 9 and so A = A', 
the transpose of A. 

In the sequel it will frequently be necessary to bear in mind 
that the matrix associated with any quadratic form is a sym- 
metrical matrix. 

We also associate with the bilinear form (2) the matrix [a^] 
of m rows and n columns; and with the linear form (1) we 
associate the single-row matrix [a a ,...,a in ], More generally, 
we associate with the m linear forms 

%*U X 1 ( i = l >-> m ) ( 4 ) 

the matrix [a^] of m rows and n columns. 

2.4. Notation. We denote the associated matrix [a if ] of any 
one of (2), (3), or (4) by the single letter A. We may then 
conveniently abbreviate 

the bilinear form (2) to A(x,y), 
the quadratic form (3) to A(x,x), 
and the m linear forms (4) to Ax. 

The first two of these are merely shorthand notations; the third, 
though it also can be so regarded, is better envisaged as the 
product of the matrix A by the matrix x, a singl e-column matrix 
having x v ... 9 x n as elements: the matrix product Ax, which has 
as many rows as A and as many columns as x , is then a single- 
column matrix of m rows having the m linear forms as its 
elements. 

2.5. Matrix expressions for quadratic and bilinear 
forms. As in § 2.4, let x denote a single-column matrix with 
elements x l9 ... 9 v n : then x', the transpose of a?, is a single-row 
matrix with elements x l9 <.. 9 x n . 

Let A denote the matrix of the quadratic form 
Then Ax is the singlercolumn matrix having 

> *'> * ‘ &ri ’ 

- ***■ , > 

as the element in its rth row, and x*Ax is a matrix of on© row 

«« ft 
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(the number of rows of x') and one column (the number of 
columns of Ax), the single element being 

2 Xr{ a nXi+-+a rn x n ) = i J a rs x r x e . 

r=l r=l a=l I 

Thus the quadratic form is represented by the single-element 
matrix x' Ax. 

Similarly, when x and y are single- column matrices having 
x v ... 9 x n and 2/ lv .., y n as row elements, the bilinear iprm (2) is 
represented by the single-element matrix x'Ay. \ 

2.6. Definition 2. The discriminant of the quadratic form 

n 7i 

2 2 a ij X i X j ( a ij = a jy) 

i = l 7 = 1 

is the determinant of its coefficients , namely |a^|. 

i 

3. Linear transformations 
3.1. The set of equations 

= 2 a ij x j (* = lj—, »)> (1) 

1 = 1 

wherein the a {j are given constants and the x j are variables, is 
said to be a linear transformation connecting the variables x i and 
the variables X { . When the a are constants in a given field JJ 
we say that the transformation has coefficients in g; when the 
a i} are real numbers we say that the transformation is real. 

Definition 3. The determinant |a^|, whose elements are the 
coefficients of the transformation (1), is called the modulus of 

THE TRANSFORMATION. 

Definition 4, A transformation is said to be non-singular 
when its modulus is not zero , and is said to be singular when 
t its modulus is zero . 

v We sometimes speak of (1) as a transformation from the 
ki to the X € ; or, briefly, x X. 

; $.2. The transformation (1) is most conveniently written as 

' x = Ax > ( 2) 
la matrix equation in which X and x denote singleteolumn 
matrices with elements X v ,.. f X n and x v ... f x n respectively, 

’ " . ' . -i . . 4 - . 
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A denotes the matrix (< a ^), and Ax denotes the product of the 
two matrices A and x. 

When A is a non-singular matrix, it has a reciprocal A~ x 
(Chap. VII, §3) and 

A~ l X — A -1 Ax — x , (3) 

which expresses x directly in terms of X. Also ( A - 1 )- 1 = A, 
and so, with a non-singular transformation, it is immaterial 
whether it be given in the form x -> X or in the form X -* x. 
Moreover, when X — Ax, given any X whatsoever, there is one 
and only one corresponding x , and it is given by x == A^X. 

When A is a singular matrix, there are X for which no corre- 
sponding x can be defined. For in such a case,, r , the rank of 
the matrix A, is less than n, and we can select r rows of A and 
express every row as a sum of multiples of these rows (Theorem 
31). Thus, the rows being suitably numbered, (1) gives relations 


— (i = f+l,...j w), (4) 

i=i 

wherein the l ki are constants, and so the set X v ..., X n is limited 
to such sets as will satisfy (4): a set of X that does not satisfy 
(4) will give no corresponding set of x. For example, in the 
linear transformation 

X = 2s+3 y, Y = 4z+6 y, 

which has the singular matrix ^ the pair X , Y must satisfy 

the relation 2X = Y ; for any pair X 9 Y that does not satisfy 
this relation, there is no corresponding pair z, y . 

3.3. The product of two transformations. Let x, y, z be 
‘numbers' (Chap. VI, §1), each with n components, and let all 
suffixes be understood to run* from 1 to n. Use the summation 
convention. Then the transformation 


(!) 

may be written as a Matrix equation x = Ay (§3.2), and the 
^information y, = b ik z k (2) 

may be written as a matrix equation y — Bz. 
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If in (1) we substitute for y in terms of 2 , we obtain 

x i ~ a ijbjk z k» (3) 

which is a transformation whose matrix is the product AB 
(compare Chap. VI, §4). Thus the result of two /successive 
transformations x = Ay and y = Bz may be writteq as 

x = ABz. 

Since AB is the product of the two matrices, wd adopt a 
similar nomenclature for the transformations themselves. 

Definition 5. The transformation x = ABz is called the 
PRODUCT OF THE TWO TRANSFORMATIONS X = Ay, y = Bz. [It 
is the result of the successive transformations x = Ay, y = Bz^\ 

The modulus of the transformation (3) is the determinant 
\ a ijbjk\> that is, the product of the two determinants \A\ and 
| J5f|. Similarly, if we have three transformations in succession, 
x = Ay, y=Bz , z = Cu , 

the resulting transformation is x = ABCu and the modulus of 
this transformation is the product of the three moduli \A |, |2?|, 
and |C|; and so for any finite number of transformations. 

4. Transformations of quadratic and bilinear forms 
4,1. Consider a given transformation 

x i = 2 b{kX k (* “ lj— » n ) (1) 

k~l 

and a given quadratic form 

n n 

A(x,x) = 2 2 (O'H = a#). (2) 

t-i i=i 

When we substitute in (2) the values of x v ..., x n given by (1), 
we ^obtain a quadratic expression in X lt ..., X n . This quadratic 
expression is said to be a transformation of (2) by (1), or the 
result of applying (1) to the form (2). 

Theorem 37. If a transformation x — BX is applied to a 
quadratic form A(x,x), the result is a quadratic form C(X,X) 
whose matrix C is given by 

C ** B'AB, 

where B' is the transpose of B. 
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First proof- — a proof that depends entirely on matrix theory. 
The form A fax) is given by the matrix product x'Ax (com- 
pare § 2.5). By hypothesis, x = BX, and this implies x’ = X’B’ 
(Theorem 23). Hence, by the associative law of multiplication, 

x’Ax = X’B’ABX 

= X’CX, (3) 

where C denotes the matrix B'AB. 

Moreover, (3) is not merely a quadratic expression in 
X lt ..., X n , but is, in fact, a quadratic form having c i} = c jt . 
To prove this we observe that, when C = B’AB, 

C’ = B'A’B (Theorem 23, Corollary) 

= B'AB (since A =■ A') 

= c- 

that is to say, c i} = c j{ . 

Second proof — a proof that depends mainly on the use of the 
summation convention. 

Let all suffixes run from 1 to n and use the summation con- 
vention throughout. Then the quadratic form is 


a ij x i Xj (&ij — Oji). 

(4) 

The transformation x t = b ik X k 

(6) 

expresses (4) in the form 



(6) 


We may calculate the values of the c’s by carrying out the 
actual substitutions for the x in terms of the X. We have 


a i} x i Xj — a i} ha b ji 

so that Cjii — b ik a i} b } i — b^ b^, 

where b^ = b ik is the element in the kth row and *th 
column of B'. Hence the matrix C of (6) is equal to the 
matrix product B'AB, 

Mo*eovejc, c w = for, since a {j — a it , we have , 

hk*ii b » ~ b ik a ji b Jt> 
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whose modulus , that is, the determinant is equal to M ; let 
the resulting quadratic form be 

2 .2 C i ( c ij = c ji)‘ (2) 

i= 1 J=1 j 

Then the discriminant of ( 2) is M 2 times the discriminant of (1); 
in symbols | c j = \ ( 3 ) 

The content of this theorem is usefully abbreviated to the 
statement ‘when a quadratic form is changed to new 

VARIABLES, THE DISCRIMINANT IS MULTIPLIED BY THE SQUARE 
OF THE MODULUS OF THE TRANSFORMATION 

The result is an immediate corollary of Theorem 37. By that 
theorem, the matrix of the form (2) is C = UAL, where L is 
the matrix of the transformation. The discriminant of (2), that 
is, the determinant \C\, is given (Chap. VI, § 10) by the product 
of the three determinants \L'\, \A\, and \L\. 

But \L\ = M , and since the value of a determinant is 
unaltered when rows and columns are interchanged, \L'\ is 
also equal to M . Hence 

\C\ = M 2 \A\. 

This theorem is of fundamental importance ; it will be used 
many times in the chapters that follow. 

5. Hermitian forms 

5.1. In its most common interpretation a hermitian bi- 
linear form is given by — 

(1) 

wherein the coefficients belong to the field of complex num- 
bers and are such that 

a jt = &ij- 

The bar denotes, as in the theory of the complex variable, the 
conjugate complex; that is, if z = a+ifl, where oc and fi are real, 

then z = a—iB. 

, 0 
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The matrix A of the coefficients of the form (1) satisfies the 
condition A , = j (2) 

Any matrix A that satisfies (2) is said to be a hermitian 
matrix. 


5.2, A form such as 

n n 

2 {a n = a ti ) 

1^1 j- 1 

is said to be a iiermitian form. 

The theory of these forms is very similar to that of ordinary 
bilinear and quadratic forms. Theorems concerning Hermitian 
forms appear as examples at the end of this and later chapters. 


6. Cogredient and contragredient sets 

6.1. When two sets of variables x v .. r , x n and y v ... 9 y n are 
related to two other sets 1^..., X n and 3^,..., Y n by the same 
transformation, say 

z = AX y y = AY, 

then the two sets x and y (equally, X and Y) are said to be 
cogredient sets of variables. 

If a set z v ..., z n is related to a set Z^..., Z n by a transforma- 
tion whose matrix is the reciprocal of the transpose of A , that is, 
z = (A'j-'Z, or Z = A'z, 

then the sets x and z (equally, X and Z) are said to be contra- 
gredient sets of variables. 

Examples of cogredient sets readily occur. A transformation 
in plane analytical geometry from one triangle of reference to 
another is of the type 


x = X-\-m 1 Y + n x Z, 

y = l 2 X+m 2 Y+n 2 Z, - 
z = l 3 X-\-m 3 Y-\-n 3 Z. - 


( 1 ) 


The sets of variables (x^y^z^) and (x 2 ,y 2 ,z 2 ), regarded as the 
coordinates of two distinct points, are cogredient sets: the 
coordinates of the two points referred to the new triangle 
of reference are (X l9 Y t , Z x ) and ( X 2 , Y 2 , Z 2 ), and each is 
obtained ~by putting in the appropriate suffix in the equa- 
tions (1). 

4702 


s 
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Analytical geometry also furnishes an important example of 
contragredient sets. Let 

lx-\-my-\-nz — 0 (2) 

be regarded as the equation of a given line a in a system of 
homogeneous point-coordinates ( x , y , z) with respect to a given 
triangle of reference. Then the line (tangential) coordinates of 
a are (Z, m, n). Take a new triangle of reference and suppose 
that (1) is the transformation for the point-coordinaies. Then 
(2) becomes LX+MY+NZ = 0, 

where 


\ 


L = ly Z — |— ^2 

M = m 1 l-\- 7r? 2 m+m 3 n, 


( 3 ) 


The matrix of the coefficients of l, m, n in (3) is the transpose 
of the matrix of the coefficients of X, Y , Z in (1). Hence point- 
coordinates (x, y, z) and line-coordinates (Z, m, n) are contra- 
gredient variables when the triangle of reference is changed. 
[Notice that (1) is X, Y, Z x, y, z whilst (3) is Z, m, n-+ 
L, M , N.] 

The notation of the foregoing becomes more compact when 
we consider n dimensions, a point x with coordinates x n ), 

and a ‘flat’ Z with coordinates (Zj,..., l n ). The transformation 
x = AX of the point-coordinates entails the transformation 
L = A'l of the tangential coordinates. 


6.2. Another example of contragredient sets, one that con- 
tains the previous example as a particular case, is provided by 
differential operators. Let F(x lf ...]x n ) be expressed as a func- 
tion of X n by means of the transformation x = AX, 

where A denotes the matrix Then 

dF dF dxj dF 

dX t = Z 8X t = Z 

* j=i i » >-i i 

That is to Bay, in matrix form, if x = AX, 

± = A'± 

8X dx 


Accordingly, the x t and the bjdx t form contragredient sets 
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7. The characteristic equation of a transformation 
7.1. Consider the transformation X = Ax or, in full, 

X = (i = 1 ,..., n). (1) 

J~1 

Is it possible to assign values to the variables x so that 
X t = (i = 1,..., n)y where A is independent of i? If such 
a result is to hold, we must have 

\x i = J.a ij x j (*' = 1,. (2) 

3 = 1 

and this demands (Theorem 11), when one of x v ..., x u differs 
from zero, that A be a root of the equation 

-4(A) = j A « 12 . . o, lri 

®21 a 22 ^ ■ ’ a 2n 


I a nl a v2 

This equation is called the characteristic equation of the 
transformation (1) ; any root of the equation is called a charac- 
teristic number or a latent root of the transformation. 

If A is a characteristic number, there is a set of numbers 
x x n , not all zero (Theorem 11), that satisfy equations (2). 
Let A = A x , a characteristic number. If the determinant A( X x ) 
is of rank (n— 1), there is a unique corresponding set of ratios, 
say a) . , r d) . . r (i) 

that satisfies (2). If the rank of the determinant A(Aj) is w— 2 
or less, the set of ratios is not unique (Chap. VIII, §5). 

A set of ratios that satisfies (2) when A is equal to a charac- 
teristic number A r , is called a pole corresponding to A r . 

7.2. If (x lf x 2 ,x 3 ) and (X V X 2 ,X 3 ) are homogeneous co- 
ordinates referred to a given triangle of reference in a plane, 
then (1), with \A\ ^ 0, may be regarded as a method of 
generating a one-to-one correspondence between the variable 
point (x v x z , x 3 ) and the variable point ( X 1 ,X 2 ,X 3 ). A pole 
of the transformation is then a point which corresponds to 
itself. 
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We shall not elaborate the geometrical implications f of such 
transformations; a few examples are given on the next page. 

Examples X 

1. Prove that all numbers of the form a + &V 3, where ^ and b are 

integers (or zero), constitute a ring; and that all numbers bf the form 
a+jW3, where a and b are the ratios of integers (or zero) constitute a 
field. \ 

2. Express ax 2 2hxy + by 2 as a quadratic form (in accordance with 
the definition of § 2.1 ) and show that its discriminant is ab — h\ (§ 2.6). 

3. Write down the transformation which is the product of the two 
transformations 

* = Jif+mn+niM f = A 1 X-\-fx 1 Y~\-v 1 Z 

y = + m 2 V d' n 2 £ }» 7J ~ A 2 X 4 1^2 Y + Z 

Z — hi + m 3 V 4" n 3 £ / £ = A 3 A’'+M 3 y + »'3 Z 

4. Prove that, in solid geometry, if i a , j 2 , k A and i 2 , j 2 ,k 2 arc two unit 

frames of reference for cartesian coordinates and if, for r = 1,2, 

4 A j r = k,, J r A kr = 4. K A i r = j r , 

then the transformation of coordinates has unit modulus. 

[Omit if the vector notation is not known.] 

5. Verify Theorem 37, when the original quadratic form is 

aas 2 + 2Iixy+by 2 

and the transformation is x — l 1 X+m 1 Y, y = Z a X -+■ ?n 2 Y, by actual 
substitution on the one hand and the evaluation of the matrix product 
( B'AB of the theorem) on the other. 

6. Verify Theorem 38, by the method of Example 5, when the 
bilinear form is 

a il X 1 2/l + a ll X 1 Vi + a 2l *2 Vl + «22 *2 Vi- 

7. The homogeneous coordinates of the points D, E , F referred to 
ABC as triangle of reference are (x 19 y[ 9 z x ) 9 (x 2 ,y z ,z 2 ) t and (x 3 ,y 3f z 3 ). 
Prove that, if ( l,m,n ) are line-coordinates referred to ABC and 
( L , M y N ) are line -coordinates referred to DEF f then 

M = X l L+X 2 M+X 3 N f etc., 

where A is the determinant \x x y 2 z 3 \ and X r ,... are the co-factors of 
£ r ,... in A. 

Hint. First obtain the transformation x , y t z — > X t Y, Z and then 
use § 6.1. 

t Cf. R. M. Winger, An Introduction to Projective Geometry (New York, 
1922), or, for homogeneous coordinates in three dimensions, G. Darboux, 
Prmdpee de gtomitrie analytique (Gauthier- Villars, Paris, 1917). 
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8. In the transformation X = Ax, now variables Y and y, defined by 

y = Bx, Y — BX (\B\^0), 

are introduced. Prove that the transformation Y->y is given by 
Y ~ Cy, where C = BAB* 1 . 

9. The transformations x = BX , y = BY change the Hermitianf 
a , j x i ?/j- (with A' — A ) into c ^ X t Y jm Prove that 

C = B'AB, C' = B'A'B = C. 

10. Prove that the transformation x = BX, together with its conju- 
gate complex x = BX, changes a ii x x x i (A' = A) into c xj Xj (C' = C), 
where C = B'AB, 

11. The elements of the matrices A and B belong to a given field 3- 
Prove that, if the quadratic form A(x, x) is transformed by the non- 
singular transformation x = BX (or by X — Bx) into C(X, X), then 
every coefficient of C belongs to 3 . 

1 2. Prove that, if each x T denotes a complex variable and a r8 are com- 
plex numbers such that a T8 — d 3T (r = l,...,n; s = l,...,n), the Hermitian 
form a Tg x T x g is a real number. 

[Examples 13-16 are geometrical in character. ] 

13. Prove that, if the points in a plane are transformed by the 
scheme 

x' = a l x + b l y^-c 1 z 9 y' = a 2 x-\-b 2 y-\-c 2 z, 

z' = a^x-\-b 2 y-\-c 2 z, 

then every straight line transforms into a straight line. Prove also that 
there are in general three points that are transformed into themselves 
and three lines that are transformed into themselves. 

Find the conditions to be satisfied by the coefficients in order that 
every point on a given line may bo transformed into itself. 

14. Show that the transformation of Example 13 can, in general, by a 
change of triangle of reference be reduced to the form 

X ' = *X, Y' — j8F, Z' = yZ . 

Hence, or otherwise, show that the transformation is a homology (plane 
perspective, collineation) if a value of A can be found which will make 

L — A b x c t 
a 2 6 a — A c 2 
, o 3 63 c a -A 

°f rank one. 

Hint. When the determinant is of rank one there is a line l such that 
every point of it corresponds to itself. In such a case any two correspond - 
ln g lines must intersect on Z. 

t The summation convention is employed in Examples 9, 10, and 12, 
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15. Obtain the equations giving, in two dimensions, the homology 
(collineation, plane perspective) in which the point (x[,y' lf z[) is the 
centre and the line (l' lt 1 2 , l' s ) the axis of the homology. 

10, Points in a plane are transformed by the scheme 
x' = px+<x(Xx+[iy + vz), 

V' = py+fi{Xx+(iy+vz), j 

z' = pz+y(\x+fiy+vz). 

Find the points and lines that are transformed into thems 

Show also that the transformation is involutory (i.e. two applications 
of it restore a figure to its original position) if and only if \ 

2p~\-olX~\- fifi-j-yv = 0 . \ 

[Remember that (kx' t ky\ kz') is the same point as (x\y\z').] 



CHAPTER XI 


THE POSITIVE-DEFINITE FORM 

1. Definite real forms' 

1.1. Definition 6. The real quadratic form 

n n 

.1 2 a n x i x i K = %) 

i=i 

is said to be positive-definite if it is positive for every set of 
real values of x x ,..., x n other than the set x x = x 2 = ... = x n = 0. 
It is said to be negative-definite if it is negative for every set 
of real values of x x n other than x x = x 2 = ... = x n = 0. 

For example, 3x\-\-2xl is positive-definite, while 3a:f— 2a:| is 
not positive-definite; the first is positive for every pair of real 
values of x 1 and x 2 except the pair x x — 0, x 2 =s 0; the second 
is positive when x x = x 2 = 1, but it is negative when x x = 1 
and x 2 = 2, and is zero when x x — V2 and x 2 — V3. 

An example of a real form that can never be negative but 
is not positive-definite, according to the definition, is given by 

{3x x -2x 2 Y+tx\. 

This quadratic form is zero when x x = 2, x 2 = 3, and x 3 = 0, 
and so it does not come within the scope of Definition 6. The 
point of excluding such a form from the definition is that, 
whereas it appears as a function of three variables, x v x 2 , x 3 , 
it is essentially a function of two variables, namely, 

X x = Zx x —2x 2 , X 2 = x 3 . 

A positive-definite form in one set of variables is still a 
positive-definite form when expressed in a new set of variables, 
provided only that the two sets are connected by a real non- 
singular transformation. This we now prove. 

Theorem 40. A real positive-definite form in the n variables 
,r„ is a positive-definite form in the n variables X v ..., X n 
provided that the two sets of variables are connected by a real , 
non-singular transformation. 

Let the ‘positive-definite form be B(x,x), and let the real 
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non-singular transformation be X = Ax. Then x = A~ X X. Let 
B(x , x) become C(X , X) when expressed in terms of X . 

Since B(x , x) is positive-definite, C(X , X) is positive for every 
X save that which corresponds to x — 0. But X = Ax, /here 
A is a non-singular matrix, and so X = 0 if and only if x = 0. 
Hence C(X,X) is positive for every X other than X = 0. 

Note. The equation x = 0 is a matrix equation, x being a single- 
column matrix whose elements are x 1 ,...,x n . \ 

1.2. The most obvious type of positive-definite form in n 
variables is „ ,21 ^2 


«u *!+...+«* 


(a rr > 0 ). 


We now show that every positive-definite form in n variables 
is a transformation of this obvious type. 


Theorem 41. Every real positive-definite form , B(x , x), can be 
transformed by a real transformation of unit modulus into a form 

c u Xl+...+c nn Xl 
wherein each c rr is positive. 


The manipulations that follow are typical of others that occur 
in later work. Here we give them in detail; later we shall refer 
back to this section and omit as many of the details as clarity 
permits. 

Let the given positive-definite form be 


B(x,x) = 2 Y bijXiXj (b tj = b J{ ) 

1=1 j=l 

= I b rT ^r J r^J.b T ,x T x r ( 1 ) 

r=l r<8 

The terms of (1) that involve z 1 are 

bii*l+2b 12 x 1 x 2 +...+2b ln x l x n . (2) 

Moreover, since (1) is positive-definite, it is positive when x, = 1 
and x 2 = ... = x n = 0; hence b u > 0. Accordingly, the terms 
(2) may be written asf f 

* 11 ^ 1 + x 2 + - + * n) 2 - (&12 * 2 + + 6l „ *«)*. 


t It is essential to this Btep that bn is not zero. 
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and so, if 


we have 


X 1 = x 1 + b ^x 2 +...+ b ^x n , 
°11 &11 

X r = x r (r = 2,..., to), 


(3) 


JB(x,x) = b n X\+ a quadratic form in X> X n 

= i Pr.X r X„ (4) 

r = 2 s — 2 

say. 

The transformation (3) is non-singular, the determinant of 
the transformation being 


1 ^12/^11 • ^in/^11 — 1 - 

0 1 . . 0 


0 0 . . 1 

Hence (Theorem 40) the form (4) in X is positive-definite and 
P 22 > 0 (put = 1, X r = 0 when r ^ 2). 

Working as before, we have 

B(.r,x) = b n Xl+Ux 2 +^X 3 +...+^X n Y + 

\ P22 P 22 / 

+ a quadratic form in Jl 3 ,..., X n . 

Let this quadratic form in X 3 ,..., X n be 2 2 VraX r X 8 '> then, on 
writing 

Ta = X v 

Y 2 = X 2 +^X 3 +...+^X n , }• ( 5 ) 

P 22 P22 

F r = JT, (r = 3 to), 

we have 

B(x,x) = b n Yl+P M Yl+i fy ra Y r Y a . ( 6 )' 

3 « — 3 

The transformation (5) is of unit modulus and so, by Theorem 
40 applied to the form (4), form (6) is positive-definite and 
Vsa > 0 (put F s = 1, Y r =* 0 when r # 3). 

Working as before, we obtain as our next form 

6aZf +j8 22 jSi+y M Z|+ a quadratic form in Z t ,..., Z n , (7) 

wherein 6,^ /3 M , y w are positive. As a preparation for the next 
«oa m 
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step, the coefficient of Z\ can be shown to be positive by proving 
that (7) is a positive-definite form. Moreover, we have 


Z, = Y 1 = X, = * 1 + ^* 2 +... + ^* n , 

°11 °11 

Z 2 = Y 2 = x,+^x 3 +... + ^x n , 

P 22 P 22 

z a = *3 + ---r 4 +... + — 

733 733 

Z r = Y r = X r = x r (r > 3). 

Proceeding step by step in this way, we finally obtain 

B{X,X) = 6ii^i+^ 22 ^2 + y33^3+--- + K ’/m^ ( 8 ) 



wherein 6 U ,..., arc positive, and 

fl = X l + J^ X 2 + --+ l T- n ^n< 

0 ll 0 11 

i 2 = X 2+... + ^X nt h 

P22 

f« = . 


( 9 ) 


We have thus transformed B(x,x) into (8), which is of the 
form required by the theorem; moreover, the transformation 
x £ is (9), which is of unit modulus. 


2. Necessary and sufficient conditions for a positive- 
definite form _ 

2.1. Theorem 42. A set of necessary and svfficient conditions 
that the real quadratic form 

i % a u x i*j («« = %) o) 

i-i 1 

be positive-definite is 


®11 > 0, 

a ll ®12 

> o. 


a ll 

■ «i» 


a 21 a 22 



a nl • 

°nn 
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If the form is positive-definite, then, as in §1.2, there is a 
transformation 


X 1 = x 1 +^x,+ ... + C ^x n , 

t n 

X, '■r 2 +--+~x n , 

P22 


of unit modulus, whereby the form is transformed into 

a ll -^1+^22 ^2+ ■ • • + K nn (2) 

and in this form a lv /J 22 ,..., K nn are all positive. The discriminant 
of the form (2) is a ix /J 22 ... K nn . Hence, by Theorem 39, 


*11 


in 


a ll @22 ••• K n u ? 


*nl 

and so is positive. 

Now consider (1) when x n = 0. By the previous argument 
applied to a form in the n— 1 variables x n _ x , 

* n a i,n-i = a n @22 — to n— 1 terms 


a /L-l p l ■ a n-l,n-l I 

and so is positive. 

Similarly, on putting x n = 0 and z n _ x — 0, 

*n • • ®i,n — 2 I = «n P 22 - to »— 2 terms, 

a n-2,l ■ a n-2,n-2 

and so on. Hence the given set of conditions is necessary. 

Conversely, if the set of conditions holds, then, in the first 
place, a u > 0 and we may write, as in § 1.2, 

A[x,x) = aJxi + < h*x i +. J .+^x n \ + 

\ G 11 tt ll / 

-f- a quadratic form in x n 

*= a ll Xl+p it Xl+...+f3 nn X*+2fJ 2a X i X a +..., ( 3 ) 



140 


THE POSITIVE -DEFINITE FORM 


aay, where 


X 1 = x 1 +^x i +...+yx n 


*11 


*11 


X k — x k {k > 1). 

Before we can proceed we must prove that /i 22 is /positive. 
Consider the form and transformation when x k = 6 for k > 2. 
The discriminant of the form (3) is then a n f3 22) the modulus 
• of the transformation is unity, and the discriminant pf 


a n x\+ 2a 12 x x x 2 +a 22 x\ 


\ 


1S Ojj & 22 ^~~ Q‘ 


12 - 


Hence, by Theorem 39 applied to a form in the 


two variables x x and x 2 only, 


a \\ @22 — a U tt 22 G 12> 

and so is positive by hypothesis. Hence /J 22 is positive, and we 
may write (3) as 

yj-f" P 22 Y\ “I” a quadratic form in 


where 


Y 1 = X lt 


Y 2 = X 2 +^X,+ ...+^X n , 
P 22 P22 

Y k = X k (k > 2). 

That is, we may write 


A(x,x) = a n I"i +^22 ^2 + 733 l r 3 +--- + 2y 34 l^r4 + ..., 


( 4 ) 


where <z u and j8 22 are positive. 

Consider the forms (3), (4) and the transformation from X to 
Y when x k = 0 for k > 3. The discriminant of (4) is a n fi 22 y 33 , 
the modulus of the transformation is unity, and the discri- 
minant of (3) is 

I “11 “12 “13 i> 

G 21 G 22 G 23 (®) 

G 31 tt 32 a 33 

which is positive by hypothesis. Hence, by Theorem 39 applied 
to a form in the three variables x x , x 2 , x 3 only, the product 
u iip 2 zYm * s equal to the determinant (5), and so is positive. 
Accordingly, y 33 is positive. 

Vt We may proceed thus, step by step, to the result that, when 
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the set of conditions is satisfied, we may write A(x,x) in the 

form a n Xl+finXl+...+ Knn Xl (6) 

wherein a n , j3 22 ,..., K nn are positive and 


x i = x i + ~r^2+'- +— x ?i* 


*11 


a 


li 




A 


2w. , 


*2+- + ?r^*> 


(7) 


The form (6) is positive-definite in X v ..., X n and, since the 
transformation (7) is non-singular, A(x f x) is positive-definite in 
x l9 ..., x n (Theorem 40). 

2.2. We have considered the variables in the order x v x 2 ,..., 
x n and we have begun with a n . We might have considered the 
variables in the order x n , x n _ lr ,.., x x and begun with a nn . We 
should then have obtained a different set of necessary and 
sufficient conditions, namely, 


® nn ®7i,n-l 

a n-\,n a n — l,n— 1 


Equally, any permutation of the order of the variables will 
give rise to a set of necessary and sufficient conditions. 

2.3. The form A(x, x) is negative-definite if the form 
{ — A(x } x)) is positive-definite. Accordingly, the form A(x, x) 
is negative-definite if and only if 


a n 

a l2 

>0, 

°11 

a i2 

a i3 

a 21 

Uo 2 



a 22 

a 23 




«3l 

a 32 

a 33 


Examples XI 

1. Provo that each of the quadratic forms 

(i) 6a 5 + 353/ 2 +llz a + 342Az, 

(ii) 6a? 2 + 49^ a + 51z a — &2yz + 20zx— 4xy 
is positive -definite, but that 

(Ui) 4^+92/®+ 2z 2 + Syz + 6zx + Qxy 
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2. Prove that if 

3 3 

I a T ,x r x a (a„ = a„) 
r-1 «-l 

is a positive -definite form, then 

a n F = (a ll x l + a l ix 2 + a lz x z )^-[-K(x^x^) t 

where K is a positive-definite form in x 2 , .r 3 whose discriminant is a u 
times the discriminant of F. \ 

Hint. Use the transformation X t = -j-fi 12 x 2 + a 13 ;r 3 l ^2 = 

X 3 = x$ and Theorem 39. \ 

3. Prove that the discriminant of the quadratic form in .r, y\ z 

F ^ (a x x-i b 1 y-\-c 1 z) 2 -\-(a i x-{-b 2 y-\-c 2 z) 2 ^ 

is the square (by rows) of a determinant whose columns are 
a lt b lt c x \ a 2 ,b 2 ,c 2 ; 0,0,0. 

4. Extension of Example 3. Prove that a sum of squares of r distinct 
linear forms in n variables is a quadratic form in n variables of zero 
discriminant whenever r < n. 

1 

6. Harder. Prove also that the rank of such a discriminant is, in 
general, equal to r: and that the exception to the general rule arises 
when the r distinct forms are not linearly independent. 

6. Prove that the discriminant of 

is not zero unless the forms 

n 

Z ««■*. ( r = ] n) 

8-1 

are linearly dependent. 

Hint. Compare Examples 3 and 4. 

7. Prove that the discriminant of the quadratic form 

Z (*»•-*«)“ (r,s = 

r^s 

is of rank n — 1. ~ 

. 8. If /(#!,... is a function of n variables, f i denotes dfjdx^ and fa 
denotes d z f/dx i dXj evaluated at x r = 0 ^ (r = l,...,n), prove that / has a 
minimum at x T = a, provided that * 8 a positive -definite 

form and each is zero. Write down conditions that f(x , y) may be a 
minimum at the point x = a, y = j8. 

9k* By Example 12, p. 133, a Hermitian form has a real value for every 
set of values of the variables. It is said to be positive -definite when 
this value is positive for every set of values of the variables other than 
x x «...«= x n = 0. Prove Theorem 40 for a positive-definite Hermitian 
form and any non -singular transformation. 
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10. Provo that a positive-definite Hermitian form can be transformed 
by a transformation of unit modulus into a form 

CuX x X 1 + ... + c nn X n X n 
wherein each c rr is positive. _ 

Hint. Compare the proof of Theorem 41. 

11. The determinant \A \ =£ |a^| is called the discriminant of the 
Hermitian form 2 2 a ij x i x r Remembering that a Hermitian form is 
characterized by the matrix equation A ' = A, it follows, by considering 
the determinant equation 

\A\ = \A'\ = \A\, 
that the discriminant is real. 

Prove the analogue of Theorem 42 for Hermitian forms. 

12. Prove, by analogy with Example 3, that the discriminant of the 

Hermitian form ^ ^ y -f Y F 

where X =- a L x-{- Y = a 2 x + b z y + c 2 z, 

is zero. 

Obtain the corresponding analogues of Examples 4, 6, and 6. 



CHAPTER XII 


THE CHARACTERISTIC EQUATION AND 
CANONICAL FORMS 


1. The A equation of two quadratic forms / 

1.1. We have seen that the discriminant of a quadratic form 
A(x, x) is multiplied by M 2 when the variables x are changed 
to variables X by a transformation of modulus M (Theorem 39). 

If A(x,x) and C(x,x) are any two distinct quadratic forms 
in n variables and A is an arbitrary parameter, the discriminant 
of the form A(x,x)—XC(x, x) is likewise multiplied by M 2 when 
the variables are submitted to a transformation of modulus M. 
Hence, if a transformation 


*i = lh X P K\ = M/ 

j=l 

changes '£a rs x r x s , J,c rs x r x s into £ ot rg X r X g , £ y„X r k s , then 


2P 

a il — ^ C ll • 

1 a l n ^ C ln 

= 

“11— Vu • 

• ^1 n ^ Yin 


a nl~“^ C Ail • 1 

■ a nn~~^ C nn 


nl • 

' & tin nn 1 


The equation \A—XC\ ~ 0, or, in full, 


a u- Ac n • 


Ac 7? i 

a nn ^ C nn 


is called the A equation of the two forms. 

What we have said above may be summarized in the theorem : 

Theorem 43. The roots of the A equation of any two quadratic 
forms in n variables are unaltered by a non-singular linear trans- 
formation. 

The coefficient of each power A r (r = 0, 1 ,..., n) is multiplied 
by the square of the modulus of the transformation.. 

,1.2. The A equation of A(x,x) and the form x\-\-... +x* is called 
the characteristic equation of A. In full, the equation is 


a u —X 

®12 

«1» 

a 21 

®22 — ^ 

a 2n 

°nl 

°n2 

.< ; 
1 - 

/ 
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The roots of this equation are called the latent roots of the 
matrix A. The equation itself may be denoted by \A— A/] = 0, 
where I is the unit matrix of order n. 

The term independent of A is the determinant \a ik \, so that 
the characteristic equation has no zero root when |a t * fc | 0. 
When \a ik \ = 0, so that the rank of the matrix [ a ik ] is r < w, 
there is at least one zero latent root; as we shall prove later, 
there are then n—r zero roots. 

2. The reality of the latent roots 
2.1. Theorem 44. If is the matrix of a positive-definite 
form C(x,x) and [a ik ] is any symmetrical matrix ivith real ele- 
ments, all the roots of the A equation 

C-^A ®12 ^12^ n ^ln ^ I = ^ 

Ct 2 i ^21^ ®22 ^22^ ^2 n ^2n ^ 

a nl Cfii A a n g C n 2 A a nn C'r in h 

are real. 

Let A be any root of the equation. Then the determinant 
vanishes and (Theorem 11) there are numbers Z v Z 2 ,..., Z nJ not 
all zero, such that 

t(<*rs-Crs*)Zs=0 (' = 1,..., n); 

3 = 1 

that is, A J c r8 Z a = 2 a r8 Z a (r = 1,..., n ). (1) 

3 — 1 f“l 

Multiply each equation (1) by Z r , the conjugate complexf of 
Z r , and add the results. If Z r = X r +iY r , where X r and Y r are 
real, we obtain terms of two distinct types, namely, 

Z r Z r =■ (X r +iY r )(X r -iY r ) = X*+Y?, 

Z r Z a +Z r Z a = {X r +iY r )(X 8 -iY a )+{X-^^ 

= 2(X r X a -\~Y r Y a )* 

■f When Z * Z = ; < = V(— 1), X and Y real 

47« XT 
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Hence the result of the operation is 

A( | c„{X*+Y*)+2 f c rs (X r X a +Y r Y e )\ 

'r**l r<e ] 

= ( fa„(X?+Y*)+2 i a rs (X r X,+Y Y a )), 

or, on using an obvious notation, l 

A {C(X,X)+C(Y,Y)} = A(X,X)+A(Y,Y)\ (2) 

Since, by hypothesis, C(x, x) is positive-definite and since the 
numbers Z r = X r +iY r (r = 1 ,..., n) are not all zero, tpe coeffi- 
cient of A in equation (2) is positive. Moreover, since each a ik 
is real, the right-hand side of (2) is real and hence A must be real. 

Note. If the coefficient of A in (2) were zero, then (2) would tell us 
nothing about the reality of A. It is to preclude this that we require 
C(x,x) to be positive -definite. 

Corollary. If both A(x,x) and C(x,x) are positive-definite 
forms , every root of the given equation is positive. 

2.2. When c rr = 1 and c T9 = 0 (r ^ s), the form C(x,x) is 
x\-\-x\ +...+a;2 and is positive-definite. Thus Theorem 44 con- 
tains, as a special case, the following theorem, one that has 
a variety of applications in different parts of mathematics. 

Theorem 45. When [a ik \ is any symmetrical matrix with real 
elements, every root A of the equation \A— A/| = 0, that is, of 


a u~~ A 

a i2 

a m 

= 0 

II 

s? 

a 2l 

®22 ^ 

a 2 n 



<*nl 

a n2 





is real . — 

When [a ik ] is the matrix of a positive-definite farm , every root 
is positive . 

3, Canonical forms 

! 34. In this section we prove that, if A = [a iAr ] is, a square 

matrix of order n and rank r, then there is a non-singular 
transformation from variables x lt ..., x n to variables X v ..., X n 
that changes A(x, x) into 

9x^\+ 


(3) 
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where g v ..., g T are distinct from zero. We show, further, that 
if the a ik are elements in a particular field F (such as the field 
of real numbers), ti n so are the coefficients of the transforma- 
tion and the resulting g ^ We call (3) a canonical form. 

The first step in the proof is to show that any given quadratic 
form can be transformed into one whose coefficient of x\ is not 
zero. This is done in §§ 3.2 and 3.3. 

3.2. Elementary transformations. We shall have occa- 
sion to use two particular types of transformation. 

Type I. The transformation 

#1 = X r , %r = X v x 8 = X a ($ 7 ^ 1, r) 

is non -singular; its modulus is —1, as is seen by writing the 
determinant in full and interchanging the first and rth columns. 
Moreover, each coefficient of the transformation is either 1 or 0, 
and so belongs to every field of numbers F. 

If in the quadratic form J a r8 x r x B one of the numbers 
a n ,..., a 7in is not zero, then either a n ^Oora suitable trans- 
formation of type -I will change the quadratic form into 
2 2 b r S X r X 8i wherein b n 0. 

If, in the quadratic form 2 2 a r8 x r x s> Ihe lumbers 
a nn are zero, but one number a r8 (r =£ s) is not zero, then a 
suitable transformation of type I will change the form into 
2 2 b n X r X a , wherein every is zero but one of the numbers 
b ln is not zero. 

Type II. The transformation in n variables 

x 1 = x B = X x X 8 , x t = X t (t 8, 1) 

is non -singular. Its modulus is the determinant which 

(i) in the first row, has 1 in the first and sth columns and 
0 elsewhere, 

(ii) in the «th row, has 1 in the first, — 1 in the sth* and 0 in 
every other column, 

(iii) in the tth row (« # s, 1) has 1 in the principal diagonal 
position and 0 elsewhere. 

The value" of such a determinant is i2. Moreover, each element 
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of the determinant is 1, 0, or —1 and so belongs to every field 
of numbers F. 

If, in the quadratic form 2 2 Ks x r x s > the numbers 
b nn are zero, but one b l8 is not zero, then a suitable transforma- 
tion of type II will express the form as 2 2 c rfl X r .Xj a , wherein 
c u is not zero. 

3.3. The first step towards the canonical foVm. The 

foregoing elementary transformations enable us to transform 
every quadratic form into one in which a n is not zero. For 
consider any given form 

n n 

A ( X , X ) = 22 i a rs = a sr )- (1) 

r=l s=l 

If one a rr is not zero, a transformation of type I changes 
(1) into a form B(X ) X) in which 6 n is not zero. 

If every a rT is zero but at least one a rs is not zero, fL trans- 
formation of type I followed by one of type II changes (1) into 
a form C(Y, Y) in which c n is not zero. The product of these 
two transformations changes (1) directly into C(Y, Y) and the 
modulus of this transformation is ±2. 

We summarize these results in a theorem. 

Theorem 46. Every quadratic form A(x,x), with coefficients 
in a given field F , and having one a rs not zero , can be transformed 
by a non-singular transformation with coefficients in F into a form 
B(X, X) whose coefficient 6 n is not zero. 

3.4. Proof of the main theorem. 

Definition 7. The rank of a quadratic form is defined to be 
the rank of the matrix of its coefficients. 

Theorem 47. A quadratic form in n variables and of rank r, 
with coefficients in a given field F } can be transformed by a non- 
singular transformation , with coefficients in F } into the form 

« 1 zf+...+« r z r 2 , . (i) 

where a v ..., are numbers in F and no one of them is equal 
to zero. 

n n 

Let the form be 2 2 °« x i x j> an< * let A denote the matrix 

i — I /»! 
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[a^]. If every a ^ is zero, the rank of A is zero and there is 
nothing to prove. 

If one a {j is not zero (Theorem 46), there is a non-singular 
transformation with coefficients in F that changes A(x y x) into 


*1X1+ 2 1 X, X,+ i i X t X p 

i = 2 i -2 j = 2 

where cq ^ 0. This may be written as 

Jz 1+ ^x 2 +...+^zJ 2 +;i % bqXtX,, 

\ “1 “1 / *= 2J“=2 


and the non-singular transformation, with coefficients in F , 


Y 1 = X 1 +«™X 2 +... + ( ^X n , 

<*1 a i 

r £ = X t (i = 2,..., w), 
enables us to write A (z, x) as 

5 b<Wr ( 2 ) 

t = 2 J= 2 

Moreover, the transformation direct from x to Y is the pro- 
duct of the separate transformations employed; hence it is non- 
singular and has its coefficients in F t and every b is in F. 

If every b {j in (2) is zero, then (2) reduces to the form (1); 
the question of rank we defer until the end. 

If one is not zero, we may change the form J £ b tj Y i Yj 
in 1 variables in the same way as we have just changed the 
original form in n variables. We may thus show that there is 
a non-singular transformation 

(i = 2 ,..., n), (3) 

i = 2 

with coefficients in F , which enables us to write 

,i X b u Y * 7 i = a » z §+x %*u** z p 

where a 2 =£ 0. The equations (3), together with Y x = Z v con- 
stitute a non-singular transformation of the n variables I,,..., Y n 
into Z x ,..~ Z n . Hence there is a non -singular transformation, 
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the product of all the transformations so far employed, which 
has coefficients in F and changes A (x, x) into 

a l ^l + “2 Z l + 2 S C U Z i Z P ( 4 ) 

i=3 

wherein cq =£ 0 and a 2 =£ 0, and every c is in F . j 

On proceeding in this way, one of two things must happen. 
Either we arrive at a form \ 

aiXl+...+a k X%+ Y Y d ij X i x\ (5) 

<=«+i j=T+i \ 

wherein k < n, cq ^ 0,..., cq. =£ 0, but every d {J = 0, in which 
case (5) reduces to 

oqZf+...+a k Xl ( k < n)\ 
or we arrive after n steps at a form 

* t Xl+...+oc n Xl 

In either circumstance we arrive at a final form | 

* l X'+„.+* k Xl (fc<«) (6) 

by a product of transformations each of which is non-singular 
and has its coefficients in the given field F. 

It remains to prove that the number k in (6) is equal to r, 
the rank of the matrix A. Let B denote the transformation 
whereby we pass from A(x,x) to the form 

oL 1 x\-\-...+(x k x%-\- y o.x?. (7) 

i=T+ 1 

Then the matrix of (7) is (Theorem 37) B'AB. Since B is non- 
singular, the matrix B'AB has the same rank as A (Theorem 
34), that is, r. But the matrix of the quadratic form (7) con- 
sists of cq,..., <x k in the first k places of the principal diagonal 
jtnd zero elsewhere, so that its rank is fc. Hence k = r. 

drollary. If A(x, x) is a quadratic form in n variables , with 
ito in a given field F, and if its discriminant is zero, 

it carSfie transformed by a non-singular transformation , with 
coefficient? -f\ into a form (in n— 1 variables at most) 

<*1 % !+ — +*n-i ^n-l> 

where i are numbers in F. , 

c^wrilary is, of course, merely a partial statement of the 
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theorem itself; since the discriminant of A (x, x) is zero, the rank 
of the form is n — 1 or less. We state the corollary with a view 
to its immediate use in the next section, wherein F is the field 
of real numbers. 

4. The simultaneous reduction t of two real quadratic 
forms 

Theorem 48. Let A(x,x), C(x f x) be two real quadratic forms 
in n variables and let C(x, x) be positive-definite. Then there is 
a real , non-singular transformation that expresses the two forms as 

XI+...+X*, 

where A^..., A n are the roots of \A—\C\ = 0 and are all real. 

The roots of |^4 — Af7| = 0 are all real (Theorem 44). Let A x 
be any one root. Then A(x, x)— A L C{x y x) is a real quadratic 
form whose discriminant is zero and so (Theorem 47, Corollary) 
there is a real non-singular transformation from x l9 ... 9 x n to 
Y n such that 

A(x,z)- \C(x 9 x) = ot 2 Yl+...+« n Yl 
where the a’s are real numbers. Let this same transformation, 
when applied to C(x , x ), give 

C(x,x)= J £%y ii Y i Y i . (1) 

i~l 1 

Then we have, for an arbitrary A, 

A(x,x)— XC(x 9 x) = A(x, x )— Aj C(x, ^)+(A x — X)C(x,z) 

= i ViYt+iK-V.X I n,YiY f . (2) 

<«2 

Since C(x 9 x) is positive-definite in the variables x , it is also 
positive-definite in the variables Y (Theorem 40). Hence y n is* 
positive and we may use the transformation 

z 1 = r 1+ ^r l +...+^r n , 

Vu yn 

Z. = Y, (« = 2,..., «). 

This enables us to write (2) in the form (Chap. XI, § 1.2) 
A(x,x)-XC(x,x) = 

where <f> and ip are real quadratic forms in Z t ,..., Z n . 
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Since this holds for an arbitrary value of A, we have 
A(x,x) = A x y n = A 1 y ll Zl-\-6, \ 

C(x>x)= ViiZi+'l'> I 


where 6 and tfj denote quadratic forms in Z 2 ,..., Z n .i 

This is the first step towards the forms we wish to establish. 
Before we proceed to the next step we observe two things: 
(i) 0 is a positive -definite form in the n— 1 variables Z 2 ,..., Z n \ 


for 


Yu Z 1 + 'l>( Z 2>-> Z n) 




is obtained by a non-singular transformation from the form (1), 
which is positive-definite in 3^,..., Y n . 

(ii) The roots of \Q— Aifj\ = 0, together with A = A v account 
for all the roots of \A—AC\ = 0; for the forms (3) derive from 
A (x } x) and C(x,x) by a non-singular transformation and so 
(Theorem 43) the roots of 


\(\Vn—tyii) z l+9—ty\ = 0, 

that is, of 

Vi i(Ai-A) 0 0=0, 

0 0 22 — A0 22 0 2n Atp2n 

^ ^ n2 ' ®nn ^nn 

are the roots of \A— AC\ = 0. If A x is a repeated root of 
|^4 — A<7 1 = 0, then A x is also a root of |0— A^| = 0. 

Thus we may, by using the first step with 6 and ift in place 
of A and C, reduce 0, if/ to forms 

0 = A 2 a 2 C/l+fl'CC7 3 ,...,f7J, 1 , 4) 

0= « 2 I/I+f(f7 3 ,...,C/J, ) { 

where a 2 is positive, A 2 is a root of |^4— AC|, and the transforma- 
tion between Z 2 ,..., Z n and U 2 i U n is real and non-singular. 
When we adjoin the equation U t = Z x we have a real, non- 
singular transformation between Z l9 ..., Z n and U v ... t U n . Hence 
there is a real non-singular transformation (the product of all 
transformations so far used) from x l99 .. 9 x n to U v U n such that 

A(x,x) = A^ U\+A 2 oc 2 Ul+f(U 3 ,..., U n ), ) / 5 x 

C(x,x)= * x Ul+ « 2 Ul+F(U^..,U n ), I 
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where oq and a 2 are positive; and A 2 are roots, not necessarily 
distinct, of \A— A<7|; / and F are real quadratic forms. As at 
the first stage, we can show that F is positive-definite and that 
the roots of \f— AF| = 0 together with A x and A 2 account for 
all the roots of |^4 — A<7 1 = 0. 

Thus we may proceed step by step to the reduction of A (x, x) 
and C(x,x) by real non-singular transformations to the two 

form8 A(x,x) = A x ai rf+A 2 Yl+... +A (l Yl 
C(x,x)= ai 7f+ « 2 Yl+...+ a n Yl 
wherein each 0 L t is positive and A n account for all the roots 
of \A—\C\ = 0. 

Finally, the real transformation 

x r = ^ r .Y f 

gives the required result, namely, 

A(x,x) = \Xl+...+\ )h Xl C(x,x) = XI+...+X ». 

5. Orthogonal transformations 

If, in Theorem 48, the positive-definite form is arfH-. ..+#*, 
the transformation envisaged by the theorem transforms 
x\-\~...-\-x into Xl~\-...-\-X*. Such a transformation is called 
an orthogonal transformation. We shall examine such 
transformations in Chapter XIII; we shall see that they are 
necessarily non-singular. Meanwhile, we note an important 
theorem. 

Theorem 49. A real quadratic form A(x } x) in n variables can 
be reduced by a real orthogonal transformation to the form 

Xl Xl+...+X n Xl 

where Aj,..., A n account for all the roots of |A— A/| = 0. More- 
over , all the roots are real. 

The proof consists in writing xf+.-.+x* for C(x,x) in 
Theorem 48. [See also Chapter XV, § II.] 

6. The number of non-zero latent roots 

If B is the matrix of the orthogonal transformation whereby 
Afoa?) is reduced to the form 

A 1 XJ+-*-+A n X’n> 


( 1 ) 
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then (Theorem 37) B'AB is the matrix of the form (1) and 
this has A x ,..., A„ in its leading diagonal and zero elsewhere. Its 
rank is the number of A’s that are not zero. But since B is 
non -singular, the rank of B f AB is equal to the rank of A. 
Hence the rank of A is equal to the number of non-zefo roots of 
the characteristic equation \A— A/| = 0. 

7. The signature of a quadratic form 

7.1. As we proved in Theorem 47 (§3.4), a real quadratic 
form of rank r can be transformed by a real non-singulaV trans- 
formation into the form 

ai Xl+...+cc r Xl ( 1 ) 

wherein a x ,..., a r are real and not zero. 

As a glance at §3.3 will show, there are, in general, many 
different ways of effecting such a reduction; even at the first 
step we have a wide choice as to which non-zero a rs we select 
to become the non-zero b n or c n , as the case may be. 

The theorem we shall now prove establishes the fact that, 
starting from the one given form A (x, x ), the number of positive 
a’s and the number of negative a’s in (1) is independent of the 
method of reduction. 

Theorem 50. If a given real quadratic form of rank r is 
reduced by two real , non-singular transformations , B 1 and B 2 say , 
to the forms jq?, (2) 

fty*+...+j9 r Tf, (3) 

the number of positive a’s is equal to the number of positive j 8’s and 
the number of negative a’s is equal to the number of negative (3’s. 

Let n be the number of variables x v ..., x n in the initial quad- 
ratic form; let /i be the number of positive a’s and v the number 
of positive /Ts. Let the variables X 9 Y be so numbered that 
the positive a’s and /J’s come first. Then, since (2) and (3) are 
transformations of the same initial form, we have 

«» ■£*+ •••+«,* Iv-i \ x l+i— - — Kl-X’? 

S p iY l+...+p y Yi-\p y+1 \Yl +1 -...- 


I Pr\Yl (4) 
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Now suppose, contrary to the theorem, that //, > v. Then 
the n+v—fji equations 

Yi = 0, Yy = 0, = 0, X n = 0 (5) 

are homogeneous equations in the n variables f x l3 ... 9 x n . There 
are less equations than there are variables and so (Chap. VIII, 
§5) the equations have a solution x 1 = x n = £ n in which 

f are not all zero. 

Let X' r , Y' r be the values of X ri Y r when x = f . Then, from 
(4) and (5), 

ai X^+...+^X'; = -\Py +1 \Y' v 2 +1 -...-\p T \Y'*, 
which is impossible unless each X' and Y' is zero. 

Hence either we have a contradiction or 

*; = °> x u=o, \ 

and, from (5), X ^ +1 = 0, ; = 0. / ' 

But (6) means that the n equations 

*1 = 0, .... X n = 0, (7) 

say 2kk x k = 0 (i = 1,..., n) 

k-1 

in full, have a solution x T = in which fj,..., £ n are not all 
zero; this, in turn, means that the determinant |Z ifc | = 0, which 
is a contradiction of the hypothesis that the transformation 

n 

Xi = J,hk x k = n) 

= 1 

is non-singular. 

Hence the assumption that /x > v leads to a contradiction. 
Similarly, the assumption that v > jx leads to a contradiction. 
Accordingly, /x = v and the theorem is proved. 

7.2. One of the ways of reducing a form of rank r is by the 
orthogonal transformation of Theorem 49. This gives the form 

A 1 x;+...+* r xy, 

where A 1? ..., are the non-zero roots of the characteristic 
equation. 

Hence the number of positive a’s, or jS’s, in Theorem 50 is 
the number of positive latent roots of th? form, 
f Each X and Y is a linear form in 
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7.3, We have proved that, associated with every quadratic 
form are two numbers P and N , the number of positive and 
of negative coefficients in any canonical form. The sum of P 
and N is the rank of the form. 

Definition 8. The number P—N is called the signature of 
the form. 

7.4. We conclude with a theorem which states that any two 

quadratic forms having the same rank and signature are, in 
a certain sense, equivalent. \ 

Theorem 51. Let A x (x, x ), A 2 {y, y) be two real quadratic farms 
having the same rank r and the same signature s. Then there is 
a real non-singular transformation x = By that transforms 
A 1 (x,x) into A 2 (y,y). 

When A x (x, x) is reduced to its canonical form it becomes 

: (1) 

where the a’s and j3’s are positive, where /x = ^(s+r), and 
where the transformation from x to X is real and non-singular. 
The real transformation 

= XiVcq, ..., = Xp'y/otp, 

i = ^/i+i ^/a+u ••■j £t = X r ylfir 

changes (1) to £?+..+££— £* +1 — (2) 

There is, then, a real non-singular transformation, say 
x = C\£, that changes A x (x } x) into (2). 

Equally, there is a real non-singular transformation, say 
y — C 2 ^ y that changes A 2 (y y y) into (2). Or, on considering the 
reciprocal process, (2) is changed into A 2 (y,y) by the trans- 
formation £ = C 2 l y. Hence 

x = Ci C% l y 

changes A x (x,x) into A 2 (y 9 y ). 

1 Examples XII 

l 1. Two forms A(x , x) and C(x f x) have 

a rt = c„ when r = 1 h and s = l,... f n. 

Prove that the A equation of the two forms has k roots equal to unity. 
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2. Write down the A equation of the two forms ax % -\-2kxy+ by* and 
a'x t + 2h'xy+b'y 2 and prove, by elementary methods, that the roots 
are real when a > 0 and ab — h 2 > 0. 

Hint. Let 

az a + 2hxy + by 2 = a[x — aq y)(x— p x y), 
a'x z + 2h'xy+b'y 2 = a'(x- 0 L 2 y)(x-p 2 y). 

The condition for real roots in A becomes 

(ab f +a'b-2hh') 2 -4(ab-h*)(a'b'-h' 2 ) > 0, 
which can be written as 

aV'fo- «,)(&- > 0. 

When ab—h 2 > 0, at! and p t are conjugate complexes and the result can 
be proved by writing = y-H'8, = y—iS. 

In fact, the A roots are real save when a Jt f3 lt a 2 , j9 2 are real and the 
roots with suffix 1 separate those with suffix 2. 

3. Prove that the latent roots of the matrix A , where 

A(x,x) = 6.t 2 -j- 35# 2 -f- 1 lx 2 -f~ 342*2 

are all positive. 

4. Prove that when 

A(x f x) = 4x]-h9x2+2x$ + 8x a x 3 + 6x 3 x 1 + 6x 1 x 2 
two latent roots are positive and one is negative. 

5. -4=22 q tb x X t = 4 8A/dx r ; 

■4* = On ■ • a ifc X i * fk = a n • a \k 

a kl a kk X k a k- l.l ■ • a k-l,k 

X x .. X k A X x . . X k 

By subtracting from the last row multiplies of the other rows and then, 
for A k , by subtracting from the last column multiples of the other 
columns, prove that A k and are independent of the variables x x ... f x k 
when k < n. Prove also that A n ss 0. 

6. With the notation of Example 6, and with 

Dk = °n • ■ I 

a kl • • a kk I 

show, by means of Theorem 18 applied to A k , that 

f it I = -Djt-i -4 fc . 

f* 1>J 

7. .Use the result of Example 6 and the result ^4 n 0 of Example 6 to 

prove that, when no D k is zero, 

n 

- ' . a = V_JL . 

Zx^iV 
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Consider What happens when the rank, r, of A is less than n and 
D r are all distinct from zero. 

8. Transform 4x 2 x 3 + 2x 3 x 1 -\- 62^ x 2 into a form 2 with 

a n =£ 0, and hence (by the method of Chap. XI, § 1.2) find a canonical 
form and the signature of the form. 

9. Show, by means of Example 4, that 

4 x \ + 9^2 4 2#3 4 8 x 2 x 3 4 Gx 3 x l + 6;^ x 2 
is of rank 3 and signature 1. Verify Theorem 50 for any two ^dependent 
reductions of this form to a canonical form. 

10. Prove that a quadratic form is the product of two linear factors if 
and only if its rank does not exceed 2. 

Hint. Use Theorem 47. 

13. trove that the discriminant of a Hermitian form A(x, x), when it 
is transformed to new variables by means of transformations x — BX, 
x = BX, is multiplied by |Z?| X |i?|. 

Deduce the analogue of Theorems 43 and 44 for Hermitian forms. 

12. Prove that, if A is a Hermitian matrix, then all the roots of 
\A — XI | — 0 are real. 

Hint. Compare Theorem 45 and use 

n 

C(x,x) = 2 x r x T . 

r«l 

13. Prove the analogues of Theorems 46 and 47 for Hermitian forms. 

14. A(x,x), C(x,x) are Hermitian forms, of which C is positive- 
definite. Prove that there is a non -singular transformation that expresses 
the two forms as 

A 2 X x X x 4- ... -f A n X n X n , X 1 X 1 + ... + X n X nt 

where A x ,..., A n are the roots of \A — AU| = 0 and are all real. 

15. An example of some importance in analytical dynamics.^ Show 
that when a quadratic form in m-fn variables, 

2T = 2 a r„ x r x a ( a rs = ^), 

is expressed in terms of fj,..., x m + lt — , x„ H , nt where = dT/dx r , there 

are no terms involving the product of a £ by an x. 

Solution. The result is easily proved by careful manipulation. Use 
the summation convention: let the range of r and 8 be 1,..., m; let the 
range of u and t be m+ 1,..., m+n. Then 2 T may be expressed as 
2T = a r9 x T x t -f 2a ru x T y u 4- y u 3/t* 

where, as an additional distinguishing mark, we have written y instead 
of x whenever the suffix exceeds m. 

We are to express T in terms of the y’s and new variables f , given by 

£, = ^8r x r+ a BuVu (* = 1 

t Cf. Lamb, Higher Mechanics , §77: The Routhian function. I owe this 
example to Mr. J. Hodgkinson. 
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Multiply by A ar , the co-factor of a ar in A = |a ra |, a determinant of order 
m, and add- we get AXf = A ,^,-A„a n y u . 

Now 2 T = x r (a r9 x B +2ci ru y u )+a ul y u y t 
= ^r(fr + aru2/u) + a ut2/u2/i. 

2AT = (A gT £ B A sr a gu y u )(f r -)-a ru 2/ u )-|-o U {2/ u i/j A 

~ ^bt fa ^bu-^st fr )“ l ”^( y » 2 /)» 

where <fi(y, y) denotes a quadratic form in the y ' s only. 

But, since a rs = a Br , we also have A ar = A ra . Thus the term multiply- 
ing y u in the above may be written as 

a TU ^TB A 8 T £t* 

and this is zero, since both r and s are dummy suffixes. Hence, 2T may 
be written in the form 

2 T = (4 w /A)f,f r +6«t»« 3/t> 

where r, « run from 1 to m and u, £ run from m+ 1 to m+n. 



CHAPTER XIII 


ORTHOGONAL TRANSFORMATIONS 

1. Definition and elementary properties 

1.1. We- recall the definition of the previous chabter. 

Definition 9. A transformation x = AX than, transforms 
xl+...+x 2 into AJ+...+A£ is called an orthogonal trans- 
formation: The matrix A is called an orthogonal ^matrix. 

The best-known example of such a transformation occurs in 
analytical geometry. When (x, y, z) are the coordinates of a 
point P referred to rectangular axes Ox, Oy, Oz and (X, Y, Z) 
are its coordinates referred to rectangular axes OX, OY, OZ, 
whose direction-cosines with regard to the former axes are 
(l v m v n x ), (l 3 , w 2 , n 2 ), (l 3 , m 3 , n 3 ), the two sets of coordinates 
are connected by the equations 

x = l 1 X+l 3 Y+l 3 Z, 

y = m 1 X+m 2 Y-\-in 3 Z, 

z = n 3 X+ n 2 Y+n 3 Z. 

Moreover, x 2 -\-y 2 -\-z 2 = X 2 -{-Y 2 -{-Z 2 = OP 2 . 

1.2. The matrix of an orthogonal transformation must have 
some special property. This is readily obtained. 

If A = [o r J is an orthogonal matrix, and x = AX, 

IX 2 = fx 2 = I (a rl X^...+a rn X n ) 2 

r- 1 r=l r»l - 

for every set of values of the variables X r . Hence 

«!,+ ••■+< = 1 ( 8 — 1 >-» n )> 
a \ a a u~^~‘"~^ a ns a nl = 0 ( s 0 - 

These are the relations that mark an orthogonal matrix: they 
are equivalent to the matrix equation 

A' A = 7, 

as is seen by forming the matrix product A’ A. 



( 2 ) 



ORTHOGONAL TRANSFORMATIONS 


161 


1.3. When (2) is satisfied A' is the reciprocal of A (Theorem 
25), so that A A' is also equal to the unit matrix, and from this 
fact follow the relations (by writing A A’ in full) 

<+■■■+< =1 (s = 1 n), 

a ai %+••• +«,n a ln = 0 {s # t). 

When n = 3 and the transformation is the change of axes 
noted in §1.1, the relations (1) and (3) take the well-known 

forms = 1, l x m 1 +l 2 m 2 -\-l i m 3 = 0, 

— 1 , = 0 , 

and so on. 

1.4. We now give four theorems that embody important 
properties of an orthogonal matrix. 

Theorem 52. A necessary and sufficient condition for a square 
matrix A to be orthogonal is A A' = I. 

This theorem follows at once from the work of §§ 1.2, 1.3. 
Corollary. Every orthogonal transformation is non-singular. 

Theorem 53. The product of two orthogonal transformations 
is an orthogonal transformation. 

Let x = AX, X = BY be orthogonal transformations. Then 
AA' = I, BB' = I. 

Hence {AB){AB)' = ABB' A' (Theorem 23) 

= AIA' 

= AA' = I, 

and the theorem is proved. 

Theorem 54. The modulus of an orthogonal transformation is 
either +1 or —l. 

If AA' = I and | A | is the determinant of the matrix A , then 
|4|,|4'|.= 1 . But |4'| = \A\, and hence |4| 2 = 1 . 

Theorem 56. If X is a latent root of an orthogonal transforma- 
tion, then so is 1/A. 

Let A be an orthogonal matrix; then AA’ = I, and so 
V.'V . A '^ A-K , ' ■ ■ (4) 

4T0» ’ i * ./ X ‘ •. • 
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By Theorem 36, Corollary, the characteristic (latent) roots 
of A* are the reciprocals of those of A. But the characteristic 
equation of A* is a determinant which, on interchanging its 
rows and columns, becomes the characteristic equation of A. 

Hence the n latent roots of A are the reciprocals of the n 
latent roots of A , and the theorem follows. (An alternative 
proof is given in Example 6, p. 168.) 


2. The standard form of an orthogonal matrix 
2.1. In order that A may be an orthogonal matrix ihe equa- 
tions (1) of § 1.2 must be satisfied. There are 

of these equations. There are n 2 elements in A. We may there- 
fore expectf that the number of independent constants neces- 
sary to define A completely will be 

n 2 n(n+ 1) _ n(n— 1) 

2 2 


If the general orthogonal matrix of order n is to be expressed 
in terms of some other type of matrix, we must look for a matrix 
that has \n(n— 1) independent elements. Such a matrix is the 
skew-symmetric matrix of order n\ that is, 


I Pik]> b ik — — b k{ . 

For example, when n = 3, 


0 b l2 — 6 13 ] 

"“^12 0 623 

.—&13 ^23 0 


has l-f2 = 3 independent elements, namely, those lying above 
the leading diagonal. The number of such elements in the 
general case is 

l+2+...+(»-l) = Jn(n- 1). 

V f The method of counting constants indicates what results to expect 1 , it 
rarely prove* those results, and in the crude form we have used above it 
Certainly proves nothing. 
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Theorem 56. If S is a skew-symmetric matrix of order n, then, 
provided that I +5 is non-singular, 

O = (I-S)(I+S )~ 1 

is an orthogonal matrix of order n. 

Since S is skew-symmetric, 

s' = —s, (i-sy =- i+s, ( i+sy = i-s-, 

and when I+S is non-singular it has a reciprocal (I+S)- 1 . 
Hence the non-singular matrix O being defined by 

0 = (I-S)(I+8)-\ 

we have 

O’ = {(I+S^Yil-Sy (Theorem 23) 

= (I-S^il+S) (Theorem 27) 
and 00' = (l-S)(I+SY\l-SY 1 (t+ S). (5) 

Now I—S and I+S are commutative, f and, by hypothesis, 
I+S is non-singular. HenceJ 

(I-S)(I+S)~ i = (I+SY'V-S), 

and, from (5), 

OO' = (I+S)- l (I-S)(I-SY\I+S) = (/+S)- 1 (/+-S) = /. 

Hence O is an orthogonal matrix (Theorem 52). 

Note. When the elements of S are real, I +S cannot be singular. This 
fact, proved as a lemma in §2.2, was well known to Cayley, who dis- 
covered Theorem 66. 

2 . 2 . Lemma. If S is a real skew-symmetric matrix, I+S is 
non-singular. 

Consider the determinant A obtained by writing down S and 
replacing the zeros of the principal diagonal by x; e.g., with. 
n = 3, 


II 

<1 

a 

b 

, s = r 0 a 

6 

d 

X 

c 

1 

a 

O 

c 

b 

— c 

X 

0 

1 

-a 

i 

0 


t (I-S)(I+8) - P+IS-SI-S* = !*-$• = (J+S)(I-S) [SI = /«]. 
j If AQ ■. QA and Q is non-singular, then AQ- 1 = Q~ l A. 

For * Q-*AQ = Q~ X QA = A 

and so " , AQ- 1 — (Q~ l AQ)Q- 1 = Q~'AQQ~ l = Q~ l A. 
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The differential coefficient of A with respect to x is the sum of 
n determinants (Chap. I, § 7), each of which is equal to a skew- 
symmetric determinant of order n— 1 when x = 0. Thus, when 
n = 3, 


1 

0 

0 

+ 

X 

a 

b 

+ z 

la b 

—a 

x 

c 


0 

1 

0 

—a 

\x c 

-6 

— c 

X 


-6 

— c 

X 

0 

r 1 

x 

c + 

1 x 

b + 

X 

a 

9 

\ 

—c 

X 

- 

-b 

X 

—a 

X 


\ 


and the latter become skew-symmetric determinants of order 2 
when x = 0. 

Differentiating again, in the general case, we see that d 2 &/dx 2 
is the sum of n(n— 1) determinants each of which is equal to 
a skew-symmetric determinant of order n—2 when x = 0; and 
so on. ! 

By Maclaurin’s theorem, we then have 

A = A 0 +z J +x 2 2 (6) 

1 2 


•where A 0 is a skew-symmetric determinant of order n, 2 a sum 

1 

of skew-symmetric determinants of order n— 1, and so on. 

But a skew-symmetric determinant of odd order is equal to 
zero and one of even order is a perfect square (Theorems 19, 21). 


Hence 


n even b. = P 0 -\-x*P 2 -\-...-\-x n , 
n odd A = -\-x n 


where P„ P v ... are either squares or the sums of squares, and 
so P 0 , Pj,... are, in general, positive and, though they may be 
zero in special cases, they cannot be negative. 

The lemma follows on putting x = 1. 


2.3. Theorem 57. Every real orthogonal matrix A can be 
expressed in the form 

J(Z-S)(J+S)-*, 


where 8 is a skew-symmetric matrix and J is a matrix having 1 
in each diagonal place and zero elsewhere. 

As a preliminary to the proof we establish a lemma that is 
true for all square matrices, orthogonal or hot, v " 
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Lemma. Given a square matrix A it is possible to choose a 
matrix J, having ±1 in each diagonal place and zero elsewhere , 
so that — 1 is not a latent root of the matrix J A. 

Multiplication by J merely changes the signs of the elements 
of a matrix row by row; e.g. 

1 . #n # 12 # 13 = u 11 a 12 #13 

. 1 #21 #22 ®23 #21 ®22 ®23 

. • . 1J |_a 31 a 32 #33 J #31 #32 ®33- 

Bearing this in mind, we see that, either the lemma is true or, 
for every possible combination of row signs, we must have 


±«„+l 

±®12 

=t a l n 

= 0 . 


± a 21 


± a 2 n 


(8) 

±®fll 

±°2» 

• i a n»t + 1 




But we can show that (8) is impossible. Suppose it is true. 
Then, adding the two forms of (8), (i) with plus in the first row, 
(ii) with minus in the first row, we obtain 


± a 22+* 

• ± a 2» — ® 

± a 2n • ' • 



for every possible combination of row signs. We can proceed 
by a‘ like argument, reducing the order of the determinant by 
unity at each step, until we arrive at ±a nn -\-\ = 0 ; ^ * 8 

impossible to have both +#„ n + 1 = 0 and — #*m+l = 0* Hence 
(8) cannot be true. 

2.4. Proof of Theorem 57. Let A be a given orthogonal 
matrix with real elements. If A has a latent root —1, let 
J X A ~e A i be a matrix whose latent roots are all different from 
— 1, where J x is of the same type as the J of the lemma. 

Now J x is non-singular and its reciprocal J x 1 is also a matrix 
having i 1 in the diagonal places and zero elsewhere, so that 

A ass J x l J x A = J x l A x = JA V 

where J = * and is of the type required by Theorem 57. 

Moreover^ being derived from A by a change of signs of 
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certain rows, is orthogonal and we have chosen it so that the 
matrix A^A-I is non-singular. Hence it remains to prove that 
when A 1 is a. real orthogonal matrix such that A x +I is non- 
singular, we can choose a skew-symmetric matrix S so that 


A 1 = (I-S)(I+S)~K 

To do this, let 

s = (i-AM+Aj-y 

The matrices on the right of (10) are commutativef and we 
may write, without ambiguity, \ 


l 


( 10 ) 


8 = 


/-£i 

I+Ai 


S' = 


I -Aj 

I+Ai‘ 


\ 


( 11 ) 


Further, A 1 A' 1 — A' l A l = I, since A 1 is orthogonal. Thus A t 
and A[ are commutative and we may work with A lt A[, and I 
as though they were ordinary numbers and so obtain, on using 
the relation A 1 A' 1 = I, 


8+ S' = 


I-A.I-A [ 


I+A^ I+A[ 
21 — 2A 1 A' 1 


= 0. 


I-\-A 1 -\-A' 1 -\-A l A' 1 

Hence, when S is defined by (10), we have S = —S'\ that is, 
S is a skew-symmetric matrix. Moreover, from (10), 


= I — A 1 

and so, since I+S is non-singular J and I — S, I+S are com- 
mutative, we have 

A x = (I-S)(I+S)-\ (12) 

the order|| of the two matrices _on the right of (12) being 
immaterial. 

We have thus proved the theorem. 

2.4. Combining Theorems 56 and 57 in so far as they relate 
to matrices with real elements we see that 


‘If S is a real skew-symmetric matrix, then 
A = ./(f-<S)(J+/S)- 1 

, t Compare the footnote on p. 163* 

H Compare the footnote on p. 163. 


t Compare 1 2.2* 
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is a real orthogonal matrix , and every real orthogonal matrix can 
be so ivritten .* 

2.5. Theorem 58. The latent roots of a real orthogonal matrix 
are of unit modulus . 

Let [i a r8 ] be a real orthogonal matrix. Then (Chap. X, §7), 
if A is a latent root of the matrix, the equations 

= a rl + ( r = 1 «) (13) 

have a solution x v ... 9 x n other than x ± = ... = x n = 0. 

These x are not necessarily real, but since a rs is real we also 
have, on taking the conjugate complex of (13), 

= 0 , 1 X 1 + ( r = 1 ,.... n). 

By using the orthogonal relations (1) of § 1, we have 

n n 

aa y x T x r = 2 x r x,. 

r=l r=l 

But not all of x n are zero, and therefore 2 x ,+ > 0. 

Hence AA = 1, which proves the theorem. 


Examples XIII 

1. Prove that the matrix 

r-t 4 4 41 

4-444 
4 4-44 

L 4 4 4 -4J 

is orthogonal. Find its latent roots and verify that Theorems 54, 65, and 
68 hold for this matrix. 

2. Prove that the matrix 


r4 

f 

4 1 

t 

4 

-4 

U -1 

4 J 


is orthogonal. 

3. Prove that when 

S TO -c -b 
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</-£)(!+£)-* is 

f 2 (c-iib) 

l+a a +6 a +c a l+a a +6 a +c a 
— 2(c+ab) 1— a 2 +6 a — c 2 

l+a a +6 a +c a l+a a +6 a + c 2 
2(ac—b) —2 (a+6c) 

Li+a a +6 2 + c a l+a 2 + 6 2 4-c a 


2(oc+6) 
l+o 2 +6 a +c* 
2(a— 6c) 
l + a 2 + 6?+c 2 
1— a 2 — 6 2 +c 2 
i+a 2 +6 2 +c 2 J 


Hence find the general orthogonal transformation of three variables. | 

4. A given symmetric matrix is denoted by A ; X'A X is the quadratic 
form associated with A ; S is a skew -symmetric matrix such iSjmt 1 + &A 
is non-singular. Prove that, when SA — AS and 


X = 


I+SA 


Y , 


X'AX = Y'AY. 

5. (Harder. )% Prove that, when A is symmetric, S skew-symmetric, 
and it - (A + ASrMA-S), 


i*'(A + £)i? = A + S, R'(A — S)R = A-S ; 
R'AR = A, R'SR = S. 


Prove also that, when X = RY, 


X'AX = Y'AY. 


6. Prove Theorem 56 by the following method. Let A be an ortho- 
gonal matrix. Multiply the determinant \A— A/| by |A| and, in the 
resulting determinant, put A' = 1/A. 

7. A transformation x = UX, x = DX which makes 

Z Xf x r = zx r X r 

is called a unitary transformation. Prove that the mark of a unitary 
transformation is the matrix equation UU' = /. 

8. Prove that the product of two unitary transformations is itself 
unitary. 

9. Prove that the modulus M of a unitary transformation satisfies the 

equation MM = J. _ 

10. Prove that each latent root of a unitary transformation is of the 

form where ol is real. 


f Compare Lamb, Higher Mechanic chapter i, examples 20 and 21, where 
the transformation is obtained from kinematical considerations. 

; $ These results are proved in Turnbull, Theory of Determinant*, Matrices, 
cmd Invariant*. 



CHAPTER XIV 

INVARIANTS AND COVARIANTS 


1. Introduction • 

A 

1 . 1 . A detailed study of invariants and covariants is not 
possible in a single chapter of a small book. Such a study 
requires a complete book, and books devoted exclusively to 
that study already exist. All we attempt here is to introduce 
the ideas and. to develop them sufficiently for the reader to be 
able to employ them, be it in algebra or in analytical geometry. 

1.2. We have already encountered certain invariants. In 
Theorem 39 we proved that when the variables of a quadratic 
form are changed by a linear transformation, the discriminant 
of the form is multiplied by the square of the modulus of the 
transformation. Multiplication by a power of the modulus, not 
necessarily the square, as a result of a linear transformation is 
the mark of what is called an ‘invariant’. Strictly speaking, the 
word should mean something that does not change at all ; it is, 
in fact, applied to anything whose only change after a linear 
transformation of the variables is multiplication by a power of 
the modulus of the transformation. Anything that does not 
change at all after a linear transformation of the variables is 
called an ‘absolute invariant’. 


1.3. Definition of an algebraic form. Before we can give 
a precise definition of ‘invariant’ we must explain certain tech- 
nical terms that arise. 


Definition 10. A sum of terms, each of degree k in the n 
variables x, y,..., t, 


2 


( 1 ) 


wherein the a's are arbitrary constants and the sum is taken over 
all integer or zero sets of values of a, A which satisfy the 
conditions > < 


0;sg* a < k t ... t 0 ^ A < k, a+0+«..+A ^ (2) 

is called an algebraic roRM [ofdegre&k in the variables x, t. 
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For instance, ax 2 +2hxy-\-by 2 is an algebraic form of degree 
2 in x and y , and 

n\ 


2 


" r r\(n-r)\ 


3fy n - 


(3) 


is an algebraic form of degree n in a; and y. 

The multinomial coefficients &!/<*! ... A! in (1) and the binomial 
coefficients n!/r! (n—r)! in (3) are not essential, bun they lead 
to considerable simplifications in the resulting theory of the 
forms. \ 


1.4. Notations. We shall \jse F(a, x) and similar notations, 
such as to denote an algebraic form; in the notation 

F(a,x) the single a symbolizes the various constants in (1) and 
(3) and the single x symbolizes the variables. If we wish to 
mark the degree k and the number of variables n, we shall use 
F{a,x) *. I 

An alternative notation is 


(a 0 ,a 1 ,...,a n )(z,y) n , (4) 

which is used to denote the form (3), and 

(a 0 ,...)(x,.. ,,t) k , or (a 0> . ..)(*! x n ) k , 

which is used to denote the form (1). In this notation the index 
marks the degree of the form, while the number of variables is 
either shown explicitly, as in (4), or is inferred from the context. 

Clarendon type, such as x or X, will be used to denote single- 
column matrices with n rows, the elements in the rows of x or 
X being the variables of whatever algebraic forms are under 
discussion. ~ 

The standard linear transformation from variables ar 1} ..., x n 
to variables X lv .., X n , namely, 

X r = X n ( t = I,..., 7l), (5) 

will be denoted by x = MX , (6) 

M denoting the matrix of the coefficients l rs in (5). As in pre- 
vious chapters, |Af | will denote the determinant whose elements 
are: tfye elements of the matrix M. 



INVARIANTS AND COVARIANTS , 171 

On making the substitutions (5) in a form 

F(cl j x) = (7) 

we obtain a form of degree k in X v X n : this form we denote by 
G(A,X)^(A 0 ,...)(X v ... i X n )^ (8) 

The constants A in (8) depend on the constants a in (7) and 
on the coefficients l r8 . By its mode of derivation, 

G(A,X) ~ F(a,x) 

for all values of the variables x. For example, let 
F(a,x) = a lr x\ + 2a 12 x l x 2 +a 22 x*, 
x 1 = l ll X l -\-l 12 X 2 , x 2 = l 2 iX l -\-l 22 X 2 , J 

then 

F(a,x) - G(A,X) = A 11 Xl+2A 12 X 1 X 2 +A 22 Xl 
where — a 11 l\ 1 -\'2Q,^ 2 l 11 l 21 -\~a 22 T 2 y, 

A l2 = ^2 + ^12(^11^22 + ^21 ^12) “l“ a 22 ^21 ^22» 

-^22 = ^H^l2+^ a i2^12^22 + a 22^22* 

In the sequel, the last tiling we shall wish to do will be to 
calculate the actual expressions for the ^4’s in terms of the a’ s: 
it will be sufficient for us to reflect that they could be calculated 
if necessary and to remember, at times, that the A ’s are linear 
in the a’ s. 

1.5. Definition of an invariant. 

Definition 11. A function of the coefficients a of the algebraic 
form F(a,x) is said to be an invariant of the form if , whatever the 
matrix M of (6) may be, the same function of the coefficients A 
of the form G(A,X) is equal to the original function (of the coeffi- 
cients a) multiplied by a power of the determinant \M\, the power 
of \M | in question being independent of M. 

For instance, in the example of § 1.4, 

-^ 11^22 -^12 = |-^| 2 ( a il ®22 ^ 12 )* 

a result that may be proved either by laborious calculation 
or by an appeal to Theorem 39. Hence, in accordance with 
Definition 11, a u a 22 — af 2 is an invariant of the algebraic form 
1 ^ 1 + 2^8 % • 
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Again, wc may consider not merely one form F(a,x) and its 
transform G(A,X), but several forms F r (a,x) and 

their transforms G X (A, -X - ),..., G r (A,X): as before, we write 
G r (A,X) for the result of substituting for x in terms of X in 
F r (a,x ), the substitution being given by x = MX. , 

Definition 12. A function of the coefficients a of a number of 
algebraic forms F r (a,x) is said to be an invariant (sometimes a 
joint-invariant) of the forms if, whatever the matrix\ M of the, 
transformation x = MX, the same function of the coefficients A of 
the resulting forms G r (A,X) is equal to the original function (of the 
coefficients a) multiplied by a certain power of the determinant \M\. 

For example, if 

F x (a,x) = a x x+b x y, F 2 (a,x) = a 2 x+b 2 y, (9) 

and the transformation x = M X is, in full, 

x = a 1 X+fi l Y, y = ac 2 X+f} 2 Y, (10) 

so that 

F x (a,x) — G x (A,X) = (a x a x +b x oc 2 )X+(a x p i -{-b i P 2 )Y i 
F 2 (a,x) = G 2 (A,X) = (a 2 0 L 1 -\-b 2 0 L^)X-{-(a 2 p i -\-b 2 p^)iY , 
we see that 

A x B x a 1 oL 1 +b 1 a 2 a i p i +b l p 2 

A 2 B 9 CT'2 a l ^2 a 2 

a, b x oc x fJ x 

bo <*2 P 2 

Hence, in accordance with Definition 12, a x b 2 — a 2 b x is a joint- 
invariant of the two forms a x x+b x y , a 2 x+b 2 y, 

, This example is but a simple case of the rule for forming the 
'product* of two transformations: if we think of (9) as the trans- 
formation from variables F x and F 2 to variables x and y, and 
(10) as the transformation from x and y to X and Y, then (11) 
is merely a statement of the result proved in § 3.3 of Chapter X. 

1*6* Covariants. An invariant is a function of the coeffi- 
cients only. Certain functions which depend both on the 
coefpiciehtsand on the variables of "a form Yfax), or of a num- 
ber of fonps F r (a t x) 9 share with invariants the property of being 
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unaltered, save for multiplication by a power of \M\, when the 
variables are changed by a substitution x = MX. Such func- 
tions are called covariants. 

For example, let u and v be forms in the variables x and y . 
In these, make the substitutions 


x — y = l z X+m 2 Y i 

so that u, v are expressed in terms of X and F. Then we have, 
by the rules of differential calculus, 


du _ j dii , du 


ex 

du 


dv , dv j dv 

dX 1 dx 2 dy* 


: = m. 


du 


, du 

+w 2 —. 


dv dv , dv 

bY~ mi Yx +mi \ 


8Y ‘dx ' "‘By’ bY ‘Ox ' ‘By’ 

The rules for multiplying determinants show at once that 


du 

du 

h h y du 

| ^9 dx 

du 

8X 

BY 

by 

dv 

dv 

dv 

dv 

dX 

BY 

dx 

dy 


( 12 ) 


The import of (12) is best seen if we write it in full. Let 
u = a 0 x”+na 1 x n - 1 y+...+a n y*, ) 
v = b 0 x"+nb 1 x n - 1 y+...+b, l y n , ) ' 

and, when expressed in terms of X, Y, 

u = A 0 X"+nA 1 X»-'Y+...+A n Y», 
v = B 0 X»+nB 1 X»~'Y + ... + B n F". 

Then (12) asserts that 

A 0 X*-'+...+A n _ l Y»~i A 1 X”-i+...+A n Y»-' 
B 0 X n ~'+...+B n _ 1 Y B, X»-' +... + B n Y»~' 


h 


l 2 xjo 0 a: ,, - 1 +...+o„_ 1 y»- 1 a 1 x n ~ 1 +...+a n y n ~ 1 
m 2 6 1 .r , ‘- 1 +...-|-6„y n - 1 

The function 

a 0 * n-1 + o 1 x“- 1 +...+o n y"- 1 

depending- on the constants a r , b r of the forms, it, ® and on the 
variables », y, is, apart from the factor unaltered 


( 14 ) 
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when the constants a r , b r are replaced by the constants A r , B r 
and the variables x> y replaced by the variables X , Y. Accord- 
ingly, we say that (14) is a covariant of the forms u 7 v. 

1.7. Note on the definitions. When an invariant is defined as 
‘a function of the coefficients a, of a form F(a, x) t which is equal to 
the same function of the coefficients A, of the transform G(A, X), 
multiplied by a factor that depends only on the constaiits b f the trans- 
formation', ' 

it can be proved that the factor in question must be a power of the 
modulus of the transformation. \ 

In the present, elementary, treatment of the subject we\havo left 
aside the possibility of the factor being other than a power of \M |. It 
is, however, a point of some interest to note that the wider definition 
can be adopted with the same ultimate restriction on the nature of the 
factor, namely, to be a power of the modulus. 


2. Examples of invariants and co variants 
2.1. Jacotiians. Let u, v,..., w be n forms in the n variables 
x, y,..., z. The determinant 


«* 

”x ■ 

■ w x 

u u 

V y . 

■ w v 

U z 

v z 

■ W z 


where u x , u y> ... denote du/dx, Bu/dy,..., is called the Jacobian of 
the n forms. It is usually written as 


d (u, v,..., w) 

8\x,y,...,z)' 

It is, as an extension of the argument of § 1 .6 will show, a 
covariant of the forms u, v,..., w, and if x = MX, 


8(u,v,...,w ) | 

' 8{X,Y,...,Z) 8{x,y,...,z) 


2.2. Hessians. Let u be a form in the variables x, y,,.„ z 

and let u xx , u xy> ... denote 8 2 u/8x 2 , 8 2 u/8x8y Then the deter- 

npnant 


Uxx 

t 

1 f 

Uyx 

u vv * 

• u yt 

u tx 

toffy 

• 
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is called the Hessian of it and is a co variant of w; in fact, if we 
symbolize (2) as H(u;x), we may prove that 

H{u\X) = \M\ 2 H(u\x). (3) 

For, by considering the Jacobian of the forms u x , and 
calling them u v we find that §2.1 gives 


d(u v u 2 ,...) = ^1 d(u v u 2 ,...) 


But 
and if 

then 


du x 

dX'' 


8(x,y>-) 


d du 
dX dx’ 


x = l x X -\-l 2 Y *.. t 
y = m 1 X+m 2 Y+..., 


d i d ■ d , ^ 


dX 

_a_ 

dY 


d 7 d , 0 , 


Hence 


u xx U XY 
U YX U YY 

U ZX U ZY 


U XZ 

Uyz 

u zz 


(4) 


(5) 


= 

1*1 

m 1 . 

X 

UxX 

U yX . . . 


h 

m 2 . . . 


UxY 

U yY ‘ 


In 

m n . . . 


u xZ 

u vZ ■ ■ ■ 


as may be seen by multiplying the last two determinants by 
rows and applying (5). That is to say, 

which, on using (4), yields 

* , * H(u,X) = \M\*H(u;x). 
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2*3. Eliminant of linear forms. When the forms u, v,..., w 
of §2.1 are linear in the variables we obtain the result that the 
determinant \a r$ \ of the linear forms 

+ ( r = n ) 

is an invariant. It is sometimes called the eliminant of the 
forms. As we have seen in Theorem 11, the mnishing of 
the eliminant is a necessary and sufficient condition for the 
equations = 0 (r _ 1 ,..., n) ' 

to have a solution other than x x = ... = x n = 0. 

2.4. Discriminants of quadratic forms. When the form 
u of §2.2 is a quadratic form, its Hessian is independent of the 
variables and so is ar invariant. When 

U = 1 a„x r x a (a rg = a sr ), (6) 

we have d 2 u/dx r dx s = 2a rg , 

so that the Hessian of (6) is, apart from a power of 2, the 
determinant \a rB \, namely, the discriminant of the quadratic 
form. 

This provides an alternative proof of Theorem 39. 

2.5. Invariants and covariants of a binary cubic. A form 

in two variables is usually called a binary form : thus the general 
binary cubic is ^ x b + S a x x 2 y+3a z xy 2 + a 3 y*. (7) 

We can write down one covariant and deduce from it a second 
covariant and one invariant. The Hessian of (7) is (§2.2) a 
covariant; it is, apart from numerical factors, 
a Q x+a x y a x x+a 2 y 
a x x+a 2 y cqx+a 3 y 

i.e. (a 0 o 2 — a\ )x 2 + (a 0 a 3 —a x a 2 )xy+ (a x a 3 — a|)y 2 . (8) 

The discriminant of (8) is an invariant of (8)*^ we may expect 
to find (we prove a general theorem later, §3.6) that it is an 
invariant of (7) also.f The discriminant is 

(9) 

i(&o ■ . ' 

t The reader will clarify his ideas if he writes down the precise meaning of 
the phrases ‘is an invariant of (7)’, 'is an invariant of (8)\ f 
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Again, the Jacobian of (7) and (8), which we may expect f 
to be a covariant of (7) itself, is, apart from numerical factors, 

a 0 x 2 +2a 1 xy+a 2 y 2 x 2 + 2a 2 xy + a 3 y 2 

2(cIq d 2 d 2 )x- j- (a 0 d 3 "flq d 2 )y (q 0 d 3 — d x d^)x-\-2(d x d 3 — 

i.e. (a 2 d 3 — 3 a 0 a x a 2 + 2a\)x 2 + 3(a 0 a L a 3 — 2 a 0 a 2 +a 2 a 2 )x 2 y+ 

"t" d 3 &q d 2 d 3 d x dfyxy 2 (3oq d 2 d 3 dy d 2 — 2 ofyy 2 . (10) 

At a later stage we shall show that an algebraical relation 
connects (7), (8), (9), and (10). Meanwhile we note an interesting 
(and in the advanced theory, an important) fact concerning the 
coefficients of the covariants (8) and (10). 

If we write 

c {) x 2j rC 1 xy-\- lc 2 y 2 for a quadratic covariant, 
c 0 x 3 -l-c 1 x 2 y-l-~c 2 xy 2 + c 3 y 3 for a cubic covariant, 

It ‘Z . o 


and so on for covariants of higher degree, the coefficients c r are 
not a disordered set of numbers, as a first glance at (10) would 
suggest, but are given in terms of c 0 by means of the formula 


c "‘ = c,. in) 

where p is the degree of the form u. Thus, when we start with 
the cubic (7), for which p = 3, and take the covariant (8), so 
that c 0 = d Q d 2 —d f, we find that 

_ dc n , dc Q , dc Q 

% a i sTT ^ a 2 pTT a 3 ^ a o a 3~ a i a 2 — c i 

da 0 dflj ^2 

and 3a 1 |^ + 2a 2 ^i + a 3 |^i = 2(a 1 a 3 — af) = c 2 

* PCLq (/&y 

and the whole co variant (8) is thus derived, by differentiations, 
from its leading term (a 0 a 2 — a\)x 2 . 


Exercise. Provo that when (10) is written as 
c 0 z 3 -f c x x 2 y + £c 2 xy 2 + ic 3 y\ 


O ® c r , o & c r , ^C r 
c '+» = 3a ‘^ +2 ° a ^; + “ 3 ^ 

t Compare §§3.6, 3.7. 


(r = 0,1,2). 
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2.6. A method of forming joint-invariants. Let 


u= F(a,x) = (a 0 ,...,aj(x 1 ,...,x n ) k , (12) 

and suppose we know that a certain polynomial function of the 

OB ’ say ( 

is an invariant of F(a,x). That is to say, when wfe substitute 

X = MX in (12) we obtain a form \ 

u=G(A,X) = (A 0i ...,A m )(X v ...,X n )\ \ (13) 

and we suppose that the a of (12) and the A of (13) satisfy 

the relation ^a^AJ = \M\*<f>(a 0 ,...,aJ, (14) 


where s is some fixed constant, independent of the matrix M. 

Now (12) typifies any form of degree k in the n variables 
x v ..., x n , and (14) may be regarded as a statement concerning 
the coefficients of any such form. Thus, if 

V = F(a',x) = (a^,...,a' m )(x 1 ,...,x n ) k , 


is transformed by x — MX into 

« = o(A\x) = 

the coefficients a\ A f satisfy the relation 

(15) 

Equally, when A is an arbitrary constant, u-\-\v is a form of 
degree k in the variables x v ..., x n . It may be written as 

(a 0 +Aao,..., a m +\a' m )(x lt ..., x n ) k 
and, after the transformation x = M X, it becomes 

(A 0 ,~; A m )(X l X n )*+A(4 A' m )(X v ..., X n )\ 

that is, (4j-|-A4 0 ,..., A m -\-XA m )(X-i,..., X n ) k . 

Just as (15) may be regarded as being a mere change of 
notation in (14), so, by considering the invariant <f> of the form 
u+Xv, we have 

^(4 0 +A4j,..., A m +?iA' m ) = \M\ g <f>{a 0 -\-Xa^,,.., 

which is a mere change of notation in (14). 

Now each side of (10) is a polynomial in A whose coefficients 
are functions of the 4, a, and \M\\ moreover, (16) is true for 
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all values of A and so the coefficients of A 0 , A, A 2 ,... on each side 
of (16) are equal. Hence the coefficient of each power of A in 
the expansion of 

<f>{a 0 +Xa^,..., a m +Xa'J 

is a joint-invariant of u and v ; for if this coefficient is 

0>"‘> m> 

then (16) gives 

^ nv ^m) = 1-^ | S< /v(®0j ■ ®wi)' (^) 


The rules of the differential calculus enable us to formulate 
the coefficients of the powers of A quite simply. Write 
b 0 = a Q +\a' 0) b m = a m +Aa^. 


Then 


— ^(ao+Aa^,..., a M + \a m ) 


3<f> db 0 . d(f> db n 


C \L I I OL 


c>f > 0 dX 


db m dX 


- a'^.4- 4 -a' 

“ °a6 0 + -" + ■"06*’ 

and the value of (IS) when A = 0 is given by 


(18) 


( a “^ 0 + " +a;i i)^ K aJ ' (19) 

Hence, by Maclaurin’s theorem, 

<f>(a 0 +\ao,..., a m +Xa’J 

= , + a m) + 

+U a °k + ' +a "£f* a °’ 

the expansion terminating after a certain point since 
<£(&o+Aa 0 ,..., 

is a polynomial in A. Hence, remembering the result (17), we 
have 




+-...+-4; 


AJ 


-J<f>(A 0 ,... 

— ^( a ° (^0) 
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Examples XIV a 


Examples 1-4 can bo worked by straightforward algebra and without 

appeal to any general theorem. In these examples the transformation 

is taken to be . -«*- . tt 7 , tt 

x = l x X+m x Y 9 y — Z 2 A-fw 2 r, 

and so |M| is equal to l 1 m 2 —l 2 ?n 1 ; forms CLr + &?/, a# 2 -f 2bxy + ci/ 2 are 

transformed into AX+BY , iX 2 + 2RXF+ OF 2 ; and s{> for other 

forms. 

1. Prove that ab'—a'b is an invariant (joint-invariant) o)f the forms 
ax + by , a'r + &'?/ . 

^ 715 . .AB'-A'R =- \M \(ab'-a'b). 

2. Prove that ab' 2 — 2ba'b' -\-ca' 2 is an invariant of the forrnty 


ax 2 2bxy + cy 2 , a'x-\-b'y. 

Ans. AB' 2 -2BA'B'+CA' 2 = |Af | 2 (tf6' 2 -2&a'6'-f-ca' 2 ). 

3. Prove that a&'-J-a'b — 2M' is an invariant of the two quadratic 
forms a# 2 + 2hxy + by 2 , a'x 2 + 2h'xy -f- 6 't/ 2 . 

Ans. AB' + A'B-2HH' |M | 2 (aV + a'6-2/i/i'). 

4. Prove that &'(a:r-f-&;i/) — cz'(&x-f ei/) is a covariant of the two forms 
ax 2 2 bxy cy 2 , a' x +b' y . 

Ans . B'(AX-\- BY) — A'(BX-\- CY) = \M \{b'(ax-\-by) — a'(bx-\-cy)}. 

The remaining examples are not intended to be proved by sheer 
substitution. 


5. Prove the result of Example 4 by considering the Jacobian of the 
two forms ax 2 -\- 2bxy -J- cy 2 , a'x-\-b'y. 

6. Prove that (ab' — a'b)x 2 + (ac' — a'c)xy + (be' — b'c)y 2 is the Jacobian 
of the forms ax 2 + 2bxy-\-cy 2 y a'x 2 -\- 2 b'xy + c'y 2 . 

7. Prove the result of Example 1 by considering the Jacobian of the 
forms ax -f by , a'x + b'y. 

8. Prove that ^(b l c 2 ^b z c 1 )(ax-{~hy~\-gz) is the Jacobian (and so a 
covariant) of the three forms 

ax 2 + by 2 + cz 2 + 2fyz + 2gzx -\-2hxy, a l x + b l y + c l z , a 2 x + b 2 y-\-c 2 z. 

Examples 9-12 are exercises on § 2.6. 

9. Prove the result of Example 3 by first showing that ab — h 2 is an 
invariant of ax 2 + 2 hxy -f by 2 . 

10. Prove the result of Example 2 by considering the joint-invariant 
of ax 2 -\- 2hxy+ by 2 and (a'x-\-b'y) 2 . 

11. Prove that abc-\-2fgh—af 2 — bg 2 —ch 2 , i.e. the determinant 

a h g 
h b f 
9 f c 

is an invariant of the quadratic form ax 2 + by 2 + cz 2 +2fyz+2gzx-\-2hxyi 
and find two joint -invariants of this form and of the form a'x a + ... + 2 h'xy- 
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Note. These joint-invariants, usually denoted by 0, ©', 

0 = o'A-b&'B-f c'C-f 2fF+2g'G+2h'H, 

0' = aA'+bB'+cC'+2fF' + 2gG'+2hH' t 
where A, A ,... denote co -factors of a, a',... in the discriminants A, A', 
are of some importance in analytical geometry. Compare Somerville, 
Analytical Conics , chapter xx. 

12. Find a joint-invariant of the quadratic form 

ax 2 -j- by 2 + cz 2 -f 2fyz -f 2 gzx + 2hxy 

and the linear form lx~\-rny -\-nz. What is the geometrical significance 
of this joint -invariant ? ( 

Prove that the second joint -invariant, indicated by 0' of Example 11, 
is identically zero. 

13. Prove that 

l 2 lm m 2 ~ (hn'— I'm) 3 . 

211' bn' + l'm 2 mm' 
l ' 2 I'm ' m ' 2 

14. Prove that 

bx A * bx 3 c y bx 2 by 2 
b A u b 4 u 

dx*by bx 2 by 2 bxby 3 
c*u d 4 u 
bx 2 by 2 bxby A by 1 

is a covariant of a form u whoso degree exceeds 4 and an invariant of a 
form u of degree 4. 

Hint. Multiplication by the determinant of Example 13 gives a 
determinant whose first row is [x — , y = VX + m'Y ] 

b 4 u bhi^ d 4 u 

bX 2 bx 2 bXbYbx 2 bY 2 bx 2 ' 

Id B \ 2 b 2 

Compare §2.2: here anc * so on ’ 

A multiplication of the determinant just obtained by that of Example 
13 will give a determinant whose first row is 

d A u b 4 u 
BX~* BX 3 bY bX 2 bY 2 ‘ 

Hence X = MX multiplies the initial determinant by \M |®. 

15. Prove that ace + 2&cd — ad 2 — & a e — c 3 , i.e. 

lab c 
| bed 
c d e 

is an invariant of (a, b, c t d,e,)(x,y) 4 * 
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16. Provo that 

I W f£x Wjy Wy 
v xx v xy v y\ 

W xx w xv w v 

is a oovariant of tlie forms u, v, w . 

Hint. The multiplying factor is | ikf | 3 . 

I 

3. Properties of invariants 

3.1. Invariants are homogeneous forms in ^he coeffi- 
cients. Let J(a 0 ,..., a m ) be an invariant of 

u = (a 0) ..., x n ) k . 

Let the transformation x = MX change w into 

Then, whatever M may be, 

I(A 0 ,..., A m ) = \M\ a I(a 0i ..., a m ), (1) 

the index s being independent of M. 

Consider the transformation 

x i = hXi* x 2 = •••» x n = 

wherein \M\ = A w . 

Since u is homogeneous and of degree k in x v ... i x n , the values 
of ^ 0 ,..., A m are, for this particular transformation, 

a 0^ k t •••» a m^ k • 

Hence (1) becomes 

/(«o A* .... a m X k ) = A-/(a 0 ,..., a m ). (2) 

That is to say, / is a homogeneous! function of its arguments 
a 0 ,..„ a m and, if its degree is q, X k( * = A ws , a result which proves, 
further, that 8 is given in terms of k , q, and n by 

8 = kqfn. 

3.2. Weight: binary forms. In the binary form 

a 0 z k +ka x x k ~ l y+ . . . +a k y k 

the suffix of each coefficient is called its weight and the 
weight of a product of coefficients is defined to be the sum of 

f The result is a well-known one. If the reader does not know the result, s 
proof can be obtained by assuming that I is the sum of I lt each homo- 

geneous and of degrees q lt and then showing that the assumption con* 
tradicts (2). 
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the weights of its constituent factors. Thus a $ a % a\. . . is of weight 

A polynomial form in the coefficients is said to be isobaric 
when each term has the same weight. Thus a 0 a 2 — aj is isobaric, 
for a 0 a 2 is of weight 0+2, and a\ is of weight 2x1; we refer 
to the whole expression a 0 a 2 —a\ as being of weight 2, the 
phrase implying that each term is of that weight. 


3.3. Invariants of binary forms are isobaric. Let 

a k ) be an invariant of 

“ s 2 (3) 

r — o ' 

and let / be a polynomial of degree q in a 0 ,..., a k . Let the 
transformation x = MX change u into 

k k\ 

! (fc— r) 


2 ruL» A ' X ’-' r '- 


(4) 


Then, by §3.1 (there being now two variables, so that n = 2), 




whatever the matrix M may be. 
Consider the transformation 


(») 


x = X, y = AT, 

for which \M\ = A. The values of ri 0 ,..., A k are, for this parti- 
cular transformation, 

a 0 , A, a r A r , a k X k . 

That is, the poiver of A associated with each coefficient A is equal 
to the weight of the coefficient . Thus, for this particular trans- 
formation, the left-hand side of (5) is a polynomial in A such, 
that the coefficient of a power X w is a function of the a’s of 
weight w. But the right-hand side of (5) is merely 

A^/(a 0 ,...,a*), (6) 

since, for this particular transformation, \M | — A. Hence the 
only power of A that can occur on the left-hand side of (5) is 
and its coefficient must be a function of the a’s of weight 
ifajf. Moreover, since the left-hand and the right-hand side of 
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(5) are identical, this coefficient of \* kQ on the left of (5) must 
be /(a 0 ,..., a k ). 

Hence /(a 0 ,... k a fc ) is of weight \kq. 


3.4. Notice that the weight of a polynomial in the o’s must 
be an integer, so that we have, when /(a 0 ,..., a k ) is ej> polynomial 
invariant of a binary form, \ 

A k ) — \M a k ), \ (7) 

5 = \kq , \ 

and ikq = weight of a polynomial = an integek 
Accordingly, the s of (7) must be an integer. 


3.5. Weight: forms in more than two variables. In die 

form (§1.3) B 

2°a.A....A-To,— («) 


the suffix A (corresponding to the power of x n in the term) is 
called the weight of the coefficient a 0L p ) k and the weight of 
a product of coefficients is defined to be the sum of the weights 
of its constituent factors. 

Let 1(a) denote a polynomial invariant of (8) of degree q in 
the a 9 s. Let the transformation x — MX change (8) into 

<9) 

Then, by §3.1, 1(A) = \M \ k « ln I(a) (10) 

whatever the matrix M may be. 

Consider the particular transformation 

x l = X n-X = 1> x n == 

for which \M | == A. Then, by a repetition of the argument of 
§3.3, we can prove that 1(a) must be isobaric and of weight 
kqln. Moreover, since the weight is necessarily an integer, kql n 
must be an integer. 


Examples XIV b 

Examples 2-4 are the extensions to covariants of results already 
proved for invariants. The proofs of the latter need only slight modi- 
fication. 
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1. If C(a,x) is a covariant of a given form u, and C(a,x) is a sum of 
algebraic forms of different degrees, say 

C(a,x) = C l (a,xj a '+ ...+C T (a,x) m ', 


where the index m denotes degree in the variables, then each separate 
term C(a, x) w is itself a covariant. 

Hint. Since C(a 9 x) is a covariant, 

C(A,X) = \M\*C(a,x). 

Put x x t x n t for a?*,..., z n and, consequently, X x X n t for X lf ... f X n . 


The result is 


ZC r (A,X) v 't"r = \M\''ZC r (a,x) m ’rr m 


Each side is a polynomial in t and we may equate coefficients of like 
powers of t. 


2. A covariant of degree m in the variables is homogeneous in the 
coefficients. 

Hint. The particular transformation 

x x = AX lt ..., x n = AX n 


gives, as in § 3.1, C(oA*, X)® = A"'C(o, *)«, 
i.e. \-*C(a\ k , x)™ = A 7»C(a, x)™ 

This proves the required result and, further, shows that, if C is of degree 

A*« = A^A^ 


q in the coefficients o. 


or kq = ns + m. 

3. If in a covariant of a binary form in x f y we consider x to have 
weight unity and y to have weight zero (x % of weight 2, etc.), then a 
covar iant C(a f x) w f of degree q in the coefficients o, is isobaric and of 
weight \(kq+m). 

Hint* Consider the particular transformation, x = X, y = A Y and 
follow the line of argument of § 3.3. 

4. If in a covariant of a form in as*,..., x n we consider «n-i to 
have unit weight and x n to have zero weight (x x x a x n of weight 2, etc.), 
then a covariant C(a, sc)®, of degree q in the coefficients o, is isobaric 
and of weight {A?g+(n— 1 )w}/n. 

Hint. Compare §3.5. 

3.6* Invariants of a covariant. Let be a given 

form of degree k in the n variables a?*,,.., x n . Let C(a,x) w be 
a co v arian t of w, of degree xtr in the variables and of degree y 
in the coefficients o. The coefficients in C(a,»)* are not the 
actuail a’aof u but homogeneous functions of them. 

*704 B b 
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The transformation x = MX changes u(a,x) into u(A y X) 
and, since C is a covariant, there is an s such that 


C(A y X)" = \M\*C(a y x)*>. 

Thus the transformation changes C(a y x) m into \M \- a C(A y X) w . 
But G is a homogeneous function of degree q in the a’s, and 
so also in the A *s. Hence 1 

C(a,x )« = \M \-*C(A y X)* = C[A\M\-*,XW. (11) 

Now suppose that 7(6) is an invariant of v(b y x) y a form of 
degree m in n variables. That is, if the coefficients ^re indi- 
cated by B when v is expressed in terms of X y there is a t for 
which 


1(B) = |Jf|'/(6). 


( 12 ) 


Take v(b,x) to be the oovariant C(a y x) m \ then (11) shows 
that the corresponding B are the coefficients of the terms in 
C(A\M\-*lo y X). 

If 1(b) is of degree r in the 6, then when expressed in terms 
of a it is Ix(a) y a function homogeneous and of degree rq in the 
coefficients a. Moreover, (12) gives 

I X (A \M | ~ 8 to) = .\M 

or, on using the fact that I x is homogeneous and of degree rq 
in the a, |jf| \M\%{p). 

That is, /^o) is an invariant of the original form u{a,x). 

The generalities of the foregoing work are not easy to follow. 
The reader should study them in relation to the particular 
example which we now give; it is taken from § 2.5. 

u(a, x) a = a 0 x 3 +3a 1 x t y-{-3a 2 xy s +a 3 y a 
has a covariant ~ 


C(fl,x )* a (a 0 a 2 — a |)a : 2 (a 0 o 8 — o x a a )xy + («i «a — 

Being the Hessian of u, this covariant has a multiplying factor 
|Jf| a ; that is to say, 

(A 0 A i -Al)X*+... = |Jlf|*{(o 0 o a ~ol)**+...}, 
which corresponds to (11) above. 
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But if x = M X changes the quadratic form 
b Q x*+2b 1 xy+b 2 y 2 

into B 0 X*+ 2 B x XY+ B 2 7 2 , 

then B 0 B 2 -Bl= \M\ 2 (b 0 b 2 -bl). (12a) 

Take for the b y s the coefficients of (11a) and we see that 

1*1 - 4 {(^ 0 ^2-^f)(^ 1 ^3“^l)“^ 0 ^3-^ 1 ^2) 2 } 

= \M\ 2 {(a 0 a 2 - a{) (a 1 a 3 - a 2 ) — J (a 0 a 3 — a x a 2 ) 2 }, 

which proves that (9) of §2.5 is an invariant of the cubic 
a 0 x 3 + 3a ± x 2 tf+ 3 a 2 xy 2 +a 2 y 3 . 

3.7. Covariants of a covariant. A slight extension of the 
argument of § 3.6, using a covariant C(b, x) m ' of a form of degree 
nr in n variables where §3.6 uses 1(b) , will prove that C(b,x) m ' 
gives rise to a covariant of the original form u(a, x). 

3.8. Irreducible invariants and covariants. If 1(a) is an 
invariant of a form u(a, x)%, it is immediately obvious, from the 
definition of an invariant, that the square, cube,... of 1(a) are 
also invariants of u. Thus there is an unlimited number of 
invariants; and so for co variants. 

On the other hand, there is, for a given form u , only a finite 
number of invariants which cannot be expressed rationally and 
integrally in terms of invariants of equal or lower degree. 
Equally, there is only a finite number of covariants which can- 
not be expressed rationally and integrally in terms of invariants 
and covariants of equal or lower degree in the coefficients of u . 
Invariants or covariants which cannot be so expressed are called 
irreducible. The theorem may then be stated 

‘The number of irreducible covariants and invariants of a given 
form is finite. * 

This theorem and its extension to the joint co variants and 
invariants of any* given system of forms is sometimes called the 
Gordan-Hilbert theorem. Its proof is beyond the scope of the 
present book. 

Even among the irreducible covariants and invariants there 
may be an algebraical relation of such a kind that no one of 
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its terms can be expressed as a rational and integral function 
of the rest. For example, a cubic u has three irreducible co- 
variants, itself, a covariant G y and a covariant H; it has one 
irreducible invariant A: there is a relation connecting these 
four, namely, Aw a _ g*+4H*. 

We shall prove this relation in § 4.3. 


4. Canonical forms 

4.1. Binary cubics. Readers will be familiar withi^ie device 
of considering ax 2 -{-2hxy-\-by 2 in the form aX 2 +b x Y 2 , obtained 
by writing 

ax 2 -\-2hxy+by 2 = + 



and making the substitution X = x+ (h/a)y, Y = y. The 
general quadratic form was considered in Theorem 47, p. 148. 
We now show that the cubic 


a 0 x 3 +3a 1 x 2 y+ 3 a 2 xy 2 +a 3 y 3 ( 1 ) 

may, in general, be written as pX 3 -\-qY 3 \ the exceptions are 
cubics that contain a squared factor. 

There are several ways of proving this: the method that 
follows is, perhaps, the most direct. We pose the problem 

‘Is it possible to find p> q t a, fi so that (1) is identically 
equal to p(z+*y)*+q(x+PyFr (2) 

It is possible if, having chosen a and j9, we can then choose 
p and q to satisfy the four equations 

• P+q = a* P<x+qp = a lt \ (3) 

pa?+qfi 2 = o 2 , poP+qfi* = a 3 . \ 

For general values of o 0 , a v o 2 , a 3 these four equations cannot 
be consistent if a = fi. We shall proceed, at first, on the 
assumption « # 0. 

The third equation of (3) follows from the first two if we can 
choose P, ©sjo that, simultaneously, 

°a*f o =SP* * Q, — 0. 
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The fourth equation of (3) follows from the second and third 
if P, Q also satisfy * 

a s + Pa 2 +Q a 1 = 0, a(a 2 +Pa + Q) = o, P(P 2 +Pp+Q) = 0. 

That is, the third and fourth equations are linear consequences 
of the first two if a, /? are the roots of 

* 2 +P*+Q = 0, 

where P, Q are determined from the two equations 
a 2"t“ P a i~h Q a o = 0, 
a s~\~ P a z~\- Qd\ = 0. 

This means that a, p are the roots of 

(a 0 a 2 — af)t 2 +(a 1 a 2 -a 0 a 3 )t+(o 1 a 3 -al) = 0 

and, provided they are distinct, p, q can then be determined 
from the first two equations of (3). 

Thus (1) can be expressed in the form (2) provided that 

(ttj fl 2 «o a 2 ®i)(®i — a i) (4) 

is not zero, this condition being necessary to ensure that a is 
not equal to /?. 

We may readily see what cubics are excluded by the pro- 
vision that (4) is not zero. If (4) is zero, the two quadratics 

a 0 x 2 +2a 1 xy+a 2 y 2 , 
a 1 x 2 ±2a 2 xy+a 3 y 2 

have a common linear factor;! hence 

a 0 x z + 2a t xy+ a 2 y 2 , a 0 x 3 + 3 a x x 2 y+ 3 a 2 xy 2 + a 3 y 2 

have a common linear factor, and so the latter has a repeated 
linear factor. Such a cubic may be written as X 2 Y, where X 
and Y are linear in x and y . 

jr The Tfiadnr will more readily recognize the argument in the form 
2a v x+a % = 0, o 1 * t +2a a af-|-aj = 0 
have a common root; hence 

d v $ i -j~2<*i3+ai = 0, Ooa^+Sa^-fSogtf+a! = 0 
have a co mmon root, and so the latter has a repeated root* Every root common 
F(x) as t) and 0 is a reposted root of the former. , 
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4.2. Binary quartics. The quartic 

(a 0 ,...,o 4 )(a;,y) 4 (5) 

has four linear factors. By combining these in pairs we may 
write (5) as the product of two quadratic factors, say 


Uq x 2 + 2 h'xy+b'y 2 , aV-f - 2h”xy-\-b"y 2 i (6) 

Let us, at first, preclude quartics that have a squired factor, 
linear in x and y . Then, by a linear transformation, \not neces- 
sarily real, we may write the factors (6) in the forms| 

X 2 +Y 2 , atX 2 +2pXY+yY 2 . \ 

By applying Theorem 48, without insisting on the transforma- 
tion being real, we can write these in the forms 

Xl+Xl X.Xl+X.Xl (7) 

Moreover, since we are precluding quartics that have a squared 
linear factor, neither nor A 2 is zero, and the quartiq may be 
written as A^J+fAj+AjZJXJ+A, Z 4 , 

or, on making a final substitution 

X = X{X V Y = A*Z 2 , 
we may write the quartic as 

X*+6 mX*Y*+Y\ (8) 


This is the canonical form for the general quartic. A quartic 
having a squared factor, hitherto excluded from our discussion, 
may be written as Z 2 (A I Z 2 -|-A 2 y 8 ). 


4.3. Application of canonical forms. Relations among 
the invariants and covariants of a form are fairly easy to detect 
when the canonical form is considered. For example, the cubic 
u = a 0 a: 8 -f 3a, a; 2 ^ + 3a 2 xy 2 + a 3 y 3 
is known (§ 2.5) to have 

a covariant H = (a 0 o # — af)x*+... [(8) of § 2.5], 

a covariant O = (ogo,— 3a 0 o 1 a 2 +2oJ)a; 8 -|-..., [(10) of §2.5], 
and an invariant 

A = (a 0 o 3 — o 1 o 2 ) 2 — 4(a„a 2 — aD^Oa— a\). 

t We have merely to identify the two distinct factors of o'** + 2A'ey + 6 V 
with X-f tV and X—iY. 
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Unless A = 0 the cubic can, by the transformation X = x+ay, 
Y = xJ r$y of §4.1, be expressed as 

u = pZ 3 + ? y3. 

In these variables the covariants H and G become 
Hi = PlXY , = p 2 qX s —pq 2 Y 3 , 

while Ai = p 2 q 2 

It is at once obvious that 

Ajtt 2 = Gf4-4//J. 

Now, \M\ being the modulus of the transformation x = MX, 
= \M\ 2 H, G x = \M\ 3 G, = \M |«A. 

The factor of H is \M | ! because H is a Hessian. The factor \M\* of G 
is most easily determined by considering the transformation x = AAT, 
y = A Y, which gives A 0 = a 0 A 3 ,...,^ s = a a A 3 , so that 

(AlA 3 -3A 0 A 1 A i + 2Al)X* + ... 

= flj 2® 3 ).r 3 A~ 3 -|- ...} 

= A®{(aJ a 8 — 3a 0 a x a 2 + 2aJ).r 3 + ...}, 

and the multiplying factor is A®, or |A/| 3 . 

The factor \M | 5 6 of A may be determined in the same way. 

Accordingly, we have proved that, unless A = 0, 

Aw 2 = Q*+±H\ (9) 

If A = 0, the cubic can be written as X 2 Y, a form for which 
both G and H are zero. 

5. Geometrical invariants 

5.1. Projective invariants. Let a point P, having homo- 
geneous coordinates x, y, z (say areal, or trilinear, to be definite) 
referred to a triangle ABC in a plane 7r, be projected into the 
point P' in the plane n\ Let A'B'C' be the projection on 7 r of 
ABC and let x’, y', z* be the homogeneous coordinates of P' 
referred to A'B'C*. Then, as many bobks on analytical geo- 
metry prove (in one form or another), there are constants J, in, n 
mokOM x = h% > = 
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Let X, Y, Z be the homogeneous coordinates of P f referred 
to a triangle, in other than A'B'C'. Then there are relations 

x ' = A 1 X-\-p 1 Y-\-v 1 Z, 
y' = X2X+fL 2 Y+v 2 Z t 
z' = A3X+/13 T+1/3 Z. 

Thus the projection of a figure in n on to a pl^ne n gives 
rise to a transformation of the type 

x = l y X+m 1 Y+n l Z, ' 

y = Z 2 A r +rra 2 7+w 2 Z, ^ (1) 

z = l^X+m^Y -\-n^Z, i 

wherein, since x = 0, y = 0, z = 0 are not concurrent lines, 
the determinant (l t m 2 n 3 ) is not zero. 

Thus projection leads to the type of transformation we have 
been considering in the earlier sections of the chapter. Geo- 
metrical properties of figures that are unaltered by projection 
(projective properties) may be expected to correspond to in- 
variants or covenants of algebraic forms and, conversely, any 
invariant or covariant of algebraic forms may be expected to 
correspond to some projective property of a geometrical figure. 
The binary transformation 

x = l 1 X+m 1 Y, y = l i X+m i Y, (2) 

mtty be considered as the form taken by (1) when only lines 
through the vertex C of the original triangle of reference are 
in question; for such an equation as ax i -\-2hxy-\-by i = 0 corre- 
sponds to a pair of lines through C; it becomes 
AX*+ 2HXY+BY* = 0, 

say, after transformation by (2), which is a pair of lines through 
a vertex of the triangle of reference in the projected figure. 

We shall not attempt any systematic development pf the 
geometrical approach to invariant theory; we give' merely a few 
isolated examples. 

: The cross-ratio of a pencil of four lines is unaltered by pro- 
jection: the condition that the two pairs of lines 
ax > +2hxy+by*, a’x i +2h'xy+b'y t 



INVARIANTS AND COVARIANTS 


193 


should form a harmonic pencil is a&'+a'ft— 2hh' = 0: this repre- 
sents a property that is unaltered by projection and, as we 
should expect, ah'-\-a'b—2hh f is an invariant of the two alge- 
braic forms. Again, we may expect the cross-ratio of the four 

lines ax 4 +4te 3 +6co; 2 2/ 2 -f4dx2/ 3 4-cy 4 = 0 

to indicate some of the invariants of the quartic. The condition 
that the four lines form a harmonic pencil is 

J ~ ace-\-2bcd—ad' 2 — b 2 e— c 3 = 0 

and, as we have seen (Example 15, p. 181), J is an invariant 
of the quartic. The condition that the four lines form an equi- 
harmonic pencil, i.e. that the first and fourth of the cross-ratios 



(the six values arising from the permutations of the order of the 
lines) are equal, is 

I = ae—4bd-\- 3c 2 = 0. 

Moreover, 1 is an invariant of the quartic. 

Again, in geometry, the Hessian of a given curve is the locus 
of a point whose polar conic with respect to the curve is a pair 
of lines: the locus meets the curve only at the inflexions and 
multiple points of the curve. Now proper conics project into 
proper conics, line-pairs into line-pairs, points of inflexion into 
points of inflexion, and multiple points into multiple points. 
Thus all the geometrical marks of a Hessian are unaltered by 
projection and, as we should expect in such circumstances, the 
Hessian proves to be a co variant of an algebraic form. Simi- 
larly, the Jacobian of three curves is the locus of a point whose 
polar lines with respect to the three curves are concurrent; thus 
its geometrical definition turns upon projective properties of 
the curves and, as we should expect, the Jacobian of any three 
algebraic forms is a co variant. 

In view of their close connexion with the geometry of pro- 
jection, the invariants and covariants we have hitherto been 
considering are sometimes called projective invariants and co- 
variants. * 

4702 


O 0 
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5.2. Metrical invariants. The orthogonal transformation 


x = liX+niiY+^Z, \ 
y = l 2 X+m 2 Y+n 2 Z, } 


( 3 ) 


z = l 3 X+m 3 Y+n 3 Z, 

wherein (l v m v n l ) 9 (l 2 , m 2) n 2 ), and (l 3 ,m 3 ,n 3 ) are thje direction- 
cosines of three mutually perpendicular lines, is au particular 
type of linear transformation. It leaves unchanged certain 
functions of the coefficients of \ 

a# 2 + 6?/ 2 -f cz 2 + 2 fyz + 2 gzx + 2 hxy ( 4 ) 


which are not invariant under the general linear transformation. 
If (3) transforms (4) into 


X*+b Y Y 2 +c 1 Z 2 + 2/, Y Z + 2g 1 ZX+2h x XY, (5) 

we have, since (3) also transforms x 2 -\-y 2 -\-z 2 into X 2 -\ -F 2 +Z 2 , 
the values of A for which (4) — A (x 2 +y 2 -\-z 2 ) is a pair of linear 
factors are the values of A for which (5) — A(X 2 +F 2 +Z 2 ) is 
a pair of linear factors. These values of A are given by 


a— A h 


h b-X 
9 f 


= 0, 


/ 

c — A 


A Aj <7i — 0, 

K A 

9i h c i A 


respectively. By equating the coefficients of powers of A in the 
two equations we obtain 

o+6+c = <*i+6|+c l? 

A-\-B-\-C — A j -(- Bj -f- ( A = be — f", etc.), 

A = A„ 

where A is the discriminant of the form (4). That is to say, 
a+6+c, bc-\-ca-\-ab—f 2 —g 2 —h 2 are invariant under orthogonal 
transformation; they are not invariant under the general linear 
transformation. On the other hand, the discriminant A is an 
invariant of the form for all linear transformations. 

The orthogonal transformation (3) is equivalent to a change 
from one set of rectangular axes of reference to another set of 
rectangular axes. It leaves unaltered all properties of a geo- 
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metrical figure that depend solely on measurement. For 
example, 

V{W+m!+*'?)(A 2 +^+v 2 )} 1 ’ 

is an invariant of the linear forms 
Xx+py+vz, X 

under any orthogonal transformation: for (6) is the cosine of 
the angle between the two planes. Such examples have but 
little algebraic interest. 


6. Arithmetic and other invariants 

6 . 1 . There are certain integers associated with algebraic 
forms (or curves) that are obviously unchanged when the 
variables undergo the transformation x = MX. Such are the 
degree of the curve, the number of intersections of a curve with 
its Hessian, and so on. One of the less obviousf of these arith- 
metic invariants, as they are called, is the rank of the matrix 
formed by the coordinates of a number of points. 

Let the components of x be x, y,..., z, n in number. Let m 
points (or particular values of x) be given; say 

z^), ..., z m ). 

Then the rank of the matrix 






is unaltered by any non-singular linear transformation 

x = MX. 

Suppose the matrix is of rank r (< m). Then (Theorem 31) 
we can choose v rows and express the others as sums of multiples 
of these r rows. For convenience of writing, suppose all rows 
after the rth can be expressed as sums of multiples of the first 
r rows. Then there are constants X rk such that, for any 

letter t of x,..., z , /1V 

= * 1 ** 1 +' ( ' 
t This also is obvious to anyone with some knowledge of n-dimensional 
geometry. * 
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The transformation x = M X has an inverse X = or, 

. in foil, say X = 

Z = C„i*+... + C nn 2. 

Putting in suffixes for particular points 

and supposing that t is the jfth letter of x , y,..., 3 , we\have 

^r+k~^\k^k \ 

(Z = 1 

r 

= (C A X r+fc +... + 0„2 r+t )- IyC jl I,+ .,+C i ,2,). 

9 = 1 

By ( 1 ), the coefficient of each of c jlt c^ n is zero and hence 
( 1 ) implies T r+ft = A lfc J\+...+A rfc !r r . ( 2 ) 


Conversely, as we see by using x = MX where in the foregoing 
we have used X = M- X x, ( 2 ) implies ( 1 ). By Theorems 31 and 
32, it follows that the ranks of the two matrices 


r*i ■ 

i 

iH 

M 

9 

Xi 

xr 


■ Zm- 


1 

• z m . 


are equal. 


6.2. Transvectants. We conclude with an application to 
binary forms of invariants which are derived from the con- 
sideration of particular values of the variables. 

Let (x v y^), (x 2 ,2/ 2 ) be two cogredient pairs of variables (x,y) 
each subject to the transformation^ 

a; = IX+mY, y = VX+m'Y, (3) 


wherein \M\ — Iw! —Vm. Then 


dX 8x + dy ’ 


8 8 , , 
W ”"5+" 


8 


the transformation being contragredient to (3) (compare Chap. 

X, §6.2). 


3 3 3 3 

If now the operators , — operate on a function 

vAj W j 32L 2 £'■*2 
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of JT 1; Y lt X 2 , Y 2 and these are taken as independent variables, 
we have' 

8 8 8 8 
8X x W^~W x 8X i , 

“ ('I 

-( m ir+ m w)( l £, +l w) 

,4) 

Hence (4) is an invariant operator. 

Let u v v ± be binary forms u, v when we put x = x v y = y x \ 
u 2 , v 2 the same forms u , v when we put x = x 2> y = y 2 ; U lt V x 
and U 2 , V 2 the corresponding forms when u , v are expressed in 
terms of X, Y and the particular values X v and X 2 , Y 2 are 
introduced. Then, operating on the product U x V 2 , we have 

(mm -wM UM = 

(m k~k aj’ mv - wrk 

and so on. That is, for any integer r. 


3 r U^FV 2 _ r j?U l _ 


? I--.. 

TV - 1 > 


8X\ 8Y\ dX^^dX^Y^ 1 

= _®2S_ + .A 

\dx{ dy r 2 dx\ 1 dy 1 dx 2 dy r 2 1 ) 

These results are true for all pairs (x v y x ) and (x 2 ,y 2 ). We 
may, then, replace both pairs by (x, y) and so obtain the 


theorem that 


&u c?v ^ d r u d r v 
fa? djf T dx r ~ l dy dxdy^ 1 


is a covariant (possibly an invariant) of the two forms u and v. 
It is called the rth transvectant of u and v. 

Finally, when we take v = uin (5), we obtain oovariants of 

the single form u. 
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This method can be made the basis of a systematic attack 
on the invariants and covariant6 of a given set of forms. 

7. Further reading 

The present chapter has done no more than sketch some of 
the elementary results from which the theory of invariants and 
covariants starts. The following books deal more\fully with 
the subject: \ 

E. B. Elliott, An Introduction to the Algebra of Quantics (Oxford, 1895). 
Grace and Young, The Algebra of Invariants (Cambridge, 1902). 
Weitzenbock, Invariantentheorie (Groningen, 1923). 

H. W. Turnbull, The Theory of Determinants , Matrices , and Invariants 
(Glasgow, 1928). 


Examples XIV c 

1. Write down the most general function of the coefficients in 
a 0 x 3 -f3a 1 a; a 2/ + 3a 2 r2/ 2 -f a 3 2 / s which is (i) homogeneous and of degree 2 
in the coefficients, isobaric, and of weight 3; (ii) homogeneous and of 
degree 3 in the coefficients, isobaric, and of weight 4. 

Hint, (i) Three is the sum of 3 and 0 or of 2 and 1; the only terms 
possible are numerical multiples of o 0 a 3 and o 1 a 2 . 

Ans. ao 0 a 3 +)8a 1 a 2 , where a, jS are numerical constants. 

2. What is the weight of the invariant 

I ~ ae— 46d-f 3c 2 of the quartic (a,b,c,d,e)(x,y) l V 

Hint. Rewrite the quartic as (a 0 ,...,a 4 )(x, y) x . 

3. What is the weight of the discriminant |a„| of the quadratic form 

®11 *^1 " i - ®*3 ^ “ 1 “ ®33 *^8 2 " 1 “ 2®23 *: 2 ^3 " i ” 2® 31 #3 ? 

Hint. Rewrite the form as 

2 ^JVyT\ a ‘ ^ X ^’ 

summed for a+0+y = 2. Thus o n = Ae, 0 .o» a a 3 = a o.i,i* 

4. Write down the J acobian of ax 2 -f 2hxy + by 2 , a' x * + 2 k'xy+b'y 2 and 
deduce from it (§ 3.6) that 

(ab' -a'b) 2 + 4(®/i' - a'h)(bh' -b'h) 
is a joint-invariant of the two forms. 

6. Verify the theorem ‘When I is an invariant of the form 
(a 09 ...,a n )(x,y) n , 

(°®el +2 ° l +- +na *-' &:) 1 = °- 

( n °*^; +(n - 1)0, ^I + " ,+0 "a^) I== 0,t 
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in so far as it relates to (i) the discriminant of a quadratic form, (ii) the 
invariant A (§4.3) of a cubic, (iii) the invariants 7, J (§ 5.1) of a quartic. 

6. Show that the general cubic ax*-\-3bx 2 y+3cxy 2 ~\-dy z can be re- 
duced to the canonical form AX* -j-DY 3 by the substitution X = px-\~qy, 
Y — p'x + q'y, where the Hessian of the cubic is 

(ac — b 2 )x 2 +(ad— bc)xy+(bd— c 2 )y 2 == (px+qy)(p'x+q'y). 

Hint. The Hessian of AX*+3BX 2 Y+3CXY 2 +DY* reduces to 
XY: i.e. AC = B 2 , BD = C 2 . Provo that, if A ^ 0, either B = C = 0 
or the form is a cube; if A = 0, then B = C = 0 and the Hessian is 
identically zero. 

7. Find the equation of the double lines of the involution determined 
by the two line-pairs 

ax 2 ~\-2hxy~\-by 2 = 0, a'x 2 -\-2h'xy-\-b'y 2 = 0 

and prove that it corresponds to a covariant of the two forms. 

Hint. If the line-pair is a l x 2 + 2h 1 xy-{-b l y 2 =- 0, then, by the usual 
condition for harmonic conjugates, 

ab l + a l b — 2hh l = 0, a'b 1 +a 1 b'— 2h'h l = 0. 

8. If two conics S , S' are such that a triangle can be inscribed in S' 
and circumscribed to S, then the invariants (Example 11, p. 180) 
A, 0, 0', A' satisfy the relation 0 Z = 4A0' independently of the choice of 
triangle of reference. 

Hint. Consider the equations of the conics in the forms 

2 = 2Fmn+20ml-\-2Hlm = 0, 

S' 2f'yz+2g'zx+2h'xy = 0. 

The general result follows by linear transformation. 

9. Prove that tho rank of the matrix of the coefficients a T9 of m linear 

forms 0 ,,*,+ . (r = 1 m) 

in n variables is an arithmetic invariant. 

10. Prove that if (z v y T ,z T ) are transformed into (X r ,Y r , Z T ) by x = MX, 
then the determinant \x i y 2 z 3 1 is an invariant. 

11. Prove that the nth transvectant of two forms 

K,..., a n )(x, y) n r (6 0V .., b n )(x t y) n 
is linear in the coefficients of each form and is a joint -invariant of the two 
forms. It is called the lineo-linear invariant. 

An*. a 0 b n — na l 6 n _ 1 + 4n(n—l)a a fc n _2 + ... . 

12. Prove, by Theorems 34 and 37, that the rank of a quadratic form 
is unaltered by any non -singular linear transformation of the variables. 
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1. Introduction 

1.1. Vectors in three dimensions. In three-dimensional 
geometry, with rectangular axes 0 £ lt 0 £ 2 , and 0 $\, a vector 
§ = OP has components 

(£l» ^ 2 > £3)’ 

these being the coordinates of P with respect to the axes. The 
length of the vector l- is 

= (i) 

and the direction -cosines of the vector are 

£1 £2 £3 

!5l’ 151’ 151’ 

Two vectors, \ with components (£1, £ 2 , £3) and t) with com- 
ponents (77J, tj 2 , tj 3 ), are orthogonal if 

CiVi+hVt+iaVa = (2) 

Finally, if e x , e 2 , and e 3 have components 

(1,0,0), (0,1,0), and (0,0,1), 

a vector \ with components (£,, f 2 , £ 3 ) may be written as 
5 = flCl + ^2*2 + ^ 3 e 3 - 

Also, each of the vectors e x , e 2 , e 3 is of unit length, by (1), and the 
three vectors are, by (2), mutually orthogonal. They are, of 
course, unit vectors along the axes. 

1.2. Single-column matrices. A purely algebraical pre- 
sentation of these details is available to us if we use a single- 
column matrix to represent a vector. 

A vector OP is represented by a single-column matrix £ whose 
elements are f x , £ 2 , £ a . The length of the vector is defined to be 

V(f!+f!+f5)- (» 

The condition for two vectors to be orthogonal now takes a purely 
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matrix form. The transpose !■' of £ is a single-row matrix with 
elements '( l9 £ 2 , and 


5'*) = [fi 


is a single-element matrix 


f 3 ]x 


^2 


Lfi^i+fa^a+fs ^aJ- 

The condition for £ and yj to be orthogonal is, in matrix notation, 

( 2 ) 

The matrices e lt e 2 , e 3 are defined by 


"1 

- e 2 = 

"O' 

e 3 -= 

"O' 

0 


1 


0 

0_ 


0 


1_ 


they are mutually orthogonal and 

5 “ fl ^3 e 3- 


The change from three dimensions to n dimensions is im- 
mediate. With n variables, a vector \ is a single-column matrix 
with elements t t t 

The length of the vector is defined to be 

151 = V(fl [+S+-+ft). (3) 

and when |§| = 1 the vector is said to be a unit vector. 

Two vectors % and yj are said to be orthogonal when 

( 4 ) 


which, in matrix notation, may be expressed in either of the two 
format 5'r) = 0, yj'5 = 0. (5) 

In the same notation, when § is ti unit vector 

5'5 = i. 

Here, and later, .1 denotes the unit single-element matrix. 

Finally, we define the unit vector e r as the single-column 
matrix which has unity in the rth row and zero elsewhere. With 


t Note t.lie order of multiplication; wo can form the product AB only when 
the numbet of columns of A is equal to the number of rows of B (cf. p. 73) and 
hence we cannot form the products %i\' or *|5'- 
4702 D (1 
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this notation the n vectors e r (r = l,...,n) are mutually ortho- 


gonal and 


5 — fi®i+fB e *+—+£»e M . 


1.3. Orthogonal matrices. In this subsection we use x r 
to denote a vector, or single-column matrix, with elements 


Theorem 59. Let x 1? x n be mutually orthqgonal unit 
vectors . Then the square matrix X whose columns are the n vectors 
x 1? x 2 ,..., x„ is an orthogonal matrix. \ 

Proof. Let X' be the transpose of X. Then the product X' X is 


.Tn 

X 2l ' 

. . T„i 

X 

X ll 

■ ■ ■ • 

• ■ *1« 

■ - 




X 21 

. . . x 2a . 

■ * X 2n 

* lr 

*2r • 

. . x nr 


• • 


. . . 

- 

x 2n . 

* • X nn _ 


. X»1 

. . . x ll3 . 

• • *nn . 


The element in the rth row and sth column of the product is, on 
using the summation convention, 


When s = r this is unity, since x r is a unit vector; and when 
s r this is zero, since the vectors x r and x a are orthogonal. 
Hence X f X I 

the unit matrix of order n, and so X is an orthogonal matrix. 

Aliter . (The same proof in different words.) 

X is a matrix whose columns are 


x v ... t x ai .:. t x nf 

while X ' is a matrix whose towb are 

The element in the rth row and *th column of the product X'X 
is x' r x a (strictly the numerical value in this single-element 
matrix). 

Let r ^ 8; then x' f x B = 0 , since x r and x a are orthogonal. 
Let r = 8 \ then xj. x f = x a x a = 1 , since x a is a unit vector 
Hence X'X = I and X is an orthogonal matrix. 
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Corollary. In the transformation vj — X'£, or 
[X' = let y\~ y r when £ = x r . Then y r = e r . 

Proof . Let Y be the square matrix whose columns are y lf . . . , y n . 
Then, since y r = X'x r and the columns oflareXj x ?l , 

Y = XX = L 


Since the columns of I are the result is established. 

1.4. Linear dependence. As in § 1.3, x r denotes a vector 
with elements 

***1 r> r'" m > ^ nr m 

The vectors x lf ..., X m are linearly dependent if there are numbers 
/ m , not all zero, for which 

fjX.-t X/ , ? - 0. (1) 

If (l) is true only when / w are all zero, the vectors are linearly 

independent. 

Lemma 1 . Of any set of vectors at most n are linearly independent. 

Proof. Let there be given n-[-k vectors. Let A be a matrix 
having these vectors as columns. The rank r of A cannot exceed 
n and, by Theorem 31 (with columns for rows) we can select r 
columns of A and express the others as sums of multiples of the 
selected r columns. 


Aliter. Let x 1 ,.,., x ;l be any given set of linearly independent 

vectors; say n 

= ( 2 ) 

r- 1 


The determinant |X| =• |x rs | ^ 0, since the given vectors are 
not linearly dependent. Let X T3 be the cofactor of x ra in X ; then, 
from (2), „ 

2I r ,x,^e, (r = (3) 


Any \ector x„ (p > n) must be of the form 


X /> — 2 x rp e r» 
r « 1 

and therefore, by (3), must be a sum of multiples of the vectors 
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Lemma 2. Any n mutually orthogonal unit vectors are. \ linearly 
independent. 

Proof. Let x 1|a ..,x n be mutually orthogonal unit vectors. 
Suppose there are numerical multiples k r for which 

X i + --- + & H x M = 0. j (4) 

Pre-multiply by x^, where 5 is any one of 1 ,...,n. T^en 

K K = o, 

and hence, since x*x s = 1, k 8 = 0. 

Hence (4) is true only when 

h = - = K = °- 

Aliter. By Theorem 59, X'X = 1 and hence the determinant 
\X\ = ±1. The columns Xj,..., x* of X are therefore linearly 
independent. 

2. Latent vectors 

2.1. Definition. We begin by proving a result indicated, 
but not worked to its logical conclusion, in an earlier chapter. f 
Theorem 60. Let A be a given square matrix of n rows and 
columns and A a numerical constant. The matrix equation 

u4x = Ax, (1) 

in which x is a single-column matrix of n elements , has a solution 
with at least one element of x not zero if and only if A is a root of the 
equation \A-\I\ = 0. (2) 

Proof. If the elements of x are the matrix equation 

(1) is equivalent to the n equations 

n 

= A& (•■= 1,. 

These linear equations in the n variables £ 19 -.,£ n have a non- 
zero solution if and only if A is a root of the equation [Theorem 1 1 1 


a n — A 

a u . . 

°ln = 0, 

(3) 

a 21 

a M — A . . 

a 2 n 


u nl 

a n2 

• • a n n-* 



and (3) is merely (2) written in full. 

f Chapter X, § 7. 
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Definition. The roots inXof \A—XI\ = 0 are the latent roots of 
the matrix A ; when A is a latent root , a non-zero vector x satisfying 

Ax = Ax 

is a latent vector of A corresponding to the root A. 

2.2. Theorem 0 1 . Let x r and x s 6c latent vectors that correspond 
to two distinct latent roots A r and A s o/ a matrix A . Then 

(i) x r and x s arc always linearly independent . and 

(ii) when A is symmetrical , x r and x s arc orthogonal. 

Proof. By hypothesis, 

4x r = A r x r , -4x„ = A,x,. (1) 

(i) Let fc r , be numbei\s for which 

= <>• (-) 

Then, since Ax s — X s x s , (2) gives 

i4fr r x r = —A s (k\x s ) — K(k r x r ), 
and so, by (1), k r X r x r — k r X s x r . 

That is to say, when (2) holds, 

K(K~K)* r = °- 

By hypothesis x r is a non-zero vector and A r A,. Hence 
k r = 0 and (2) reduces to i,x, = 0. But, by hypothesis, x s is a 
non-zero vector and therefore k s = 0. 

Hence (2) is true only if k r = k s — 0; accordingly, x r and x s 
are linearly independent vectors. 

(ii) Let A' = A. Then, by the first equation in (1), 

x^x r = x'A,x r . (3) 

The transpose of the second equation in (1) gives 

= KK, 

and from this, since A' = A, 

x'„ A\ r — A,XgX r . (4) 

From (3) and (4), (A r — A,)x g x r = 0, 

and so, since the numerical multiplier A r — A s # 0, x', x r — 0. 
Hence x r and x g are orthogonal vectors. 



206 


LATENT VECTORS 


3. Application to quadratic forms 

We have already seen that (cf. Theorem 49, p. 153): 

A real quadratic form % A\ in the n variables can be 

reduced by a real orthogonal transformation 

l-=X^ 

to the form X x r;J + A 2 77!+...+ A, ( rjl, 

wherein A 1 ,...,A n are the n roots of \A— A/| = 0. 

We now show that the latent vectors of A provide the\columns 
of the transforming matrix X. 

3.1. When A has n distinct latent roots. 

Theorem 62 . Let A be a real symmetrical matrix having n 
distinct latent roots Then there are n distinct real unit 

latent vectors Xj x fl corresponding to these roots . If X is the 

square matrix whose columns are x^..., x ny the transformation 

l = x* 

from variables to variables is a real orthogonal 

transformation and 

Proof. The roots A 1 ,...,A n are necessarily real (Theorem 45 ) 
and so the elements of a latent vector x r satisfying 

Ax r = \ r x r (1) 

can be found in terms of real numbersf and, if the length of any 
one such vector is k y the vector k~ l x r is a real unit vector satis- 
fying (1). Hence there are n real unit latent vectors x 1 ,..., x n , and 
the matrix X having these vectors in its n columns has real 
numbers as its elements. 

Again, by Theorem 61 , x r is orthogonal to x 8 when r s and, 
since each is a unit vector, 

x;x fl = s„, (2) 

where 8 ra = 0 when r ^ s and 8 rr = 1 (r = As in .the 

t When x r is a solution and a is any number, real or complex, ax, is also a 
solution. For our present purposes, we leave aside all complex values of a. 
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proof of Theorem 59, the element in the rth row and 5th column 
of the product X'X is S ra ; whence 

X'X = I 

and X is an orthogonal matrix. 

The transformation § = X tj gives 

= r\’X'AXr\ (3) 

and the discriminant of the form in the variables rj n is the 
matrix X'AX. Now the columns of AX are Ax lt ...,Ax n , and 
these, from (1), are \ Xj,..., A n x n . Hence 

(i) the rows of X' are xi,..., x' n , 

(ii) the columns of AX are x lf ...,\ n x n and the element in 
the rth row and 5th column of X'AX is the numerical value of 

X r A g X 8 = A g X r X 8 A 8 8 rg , 

by (2). Thus X'AX has A 1? ..., A„ as its elements in the principal 
diagonal and zero elsewhere. The form is therefore 

Ai rjl+\ 2 ^2+--* + ^h vl- 

3.2. When A has repeated latent roots. It is in fact true 
that, whether \A— A/| = 0 has repeated roots or has all its 
roots distinct, there are always n mutually orthogonal real unit 
latent vectors x n of the symmetrical matrix A and, X being 

the matrix with these vectors as columns , the transformation ^ = Xttj 
is a real orthogonal transformation that gives 

%AS = \ 1 r 1 l+... + \ nV l 

wherein \ n are the n roots (some of them j)ossibly equal) of the 
characteristic equation \A— A/| = 0. 

The proof will not be given here. The fundamental difficulty 
is to provef that, when A a (say) is a fc-ple root of \A— A/| = 0, 
there are k linearly independent latent vectors corresponding 
to A|. The setting up of a system of n mutually orthogonal unit 

t Quart. J. Math. (Oxford) 18 (1947) 183-5 gives a proof by J. A. Todd; 
another treatment is given, ibid., pp. 186-92, by W. L. Ferrar. Numerical 
exatnples^are easily dealt with by actually finding the vectors; see Examples 
XV, 11 and 14. 
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vectors is then effected by Schmidt's orthogonalization process^ 
or by the equivalent process illustrated below. 

Let y l9 y 2 , y 3 be three given real linearly independent vectors. 
Choose k 1 so that k Y y Y is a unit vector and let x 2 = k 1 y Y . Choose 
the constant oc so that / 

I 

xi(y 2 -h*x,) = 0 ; \ ( 1 ) 

that is, since xi x Y = 1 , a = — xi y 2 * t 

Since y 2 and x Y are linearly independent, the vector y 2 +«x 1 
is not zero and has a non-zero length, l 2 say. Put 

x 2 i 2 _ 1 (y 2 +«x 1 ). 

Then x 2 is a unit vector and, by ( 1 ), it is orthogonal to x Y . 

Now determine /3 and y so that 

xi(y 3 +px x + y x 2 ) = 0 (2) 

and X^ya+^ + yXjj) =- 0. (3) 

That is, since x 2 and x 2 are orthogonal, 

P =■■ ~ x i y 3 > y = -x' 2 y 3 . 

The vector y f3x l yx 2 7 ^- 0 , since the vectors y 1} y 2 , y 3 are 
linearly independent ; it has a non-zero length and with 

x a =- ^ 1 (y 3 +^x 1 -j yx 2 ) 

we have a unit vector which, by (2) and (3), is orthogonal to x Y 
and x 2 . Thus the three x vectors are mutually orthogonal unit 
vectors. 

Moreover, if y 1? y 2 , y 3 are latent vectors corresponding to the 
latent root A of a matrix A, so that 

A y, — Ay,, yly 2 Ay 2 , ;1y 3 - Ay 3 , 

then also 

Ax 1 —■ Ax,, Ax 2 — Ax 2 , ^4x 3 --- Ax 3 

and X,, x 2 , x 3 are mutually orthogonal unit latent vectors 
corresponding to the root A. 

t W. L. Ferrar, Finite Matrices, Theorem 29, p. 139. 

J More precisely, a is the numerical value of the element in the single-entry 
matrix — xjy a . Both xJXj ancl xjy a are single-entry matrices. 
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4. Collineation 

Let % be a single-column matrix with elements £ lt ...,£ n and 
so for other fetters. Let these elements be the current coordinates 
of a point in a system of n homogeneous coordinates and let A 
be a given square matrix of order re. Then the matrix relation 

y = Ax (i) 

expresses a relation between the variable point x and the variable 
point y. 

Now let the coordinate system be changed from \ to tj by 
means of a transformation 

1= T n , 

where T is a non-singular square matrix. The new coordinates, 
X and Y, of the points x and y are then given by 

x=TX, y = TY. 

In the new coordinate system the relation (1) is expressed by 

TY = ATX 

or, since T is non-singular, by 

Y = T-'ATX. (2) 

The effect of replacing A in ( 1 ) by a matrix of the type T~ l AT 
amounts to considering the same geometrical relation expressed 
in a different coordinate system. The study of such replace- 
ments, A by T~ V A T, is important in projective geometry. Here 
we shall prove only one theorem, the analogue of Theorem 62. 
4.1. When the latent roots of A are all distinct. 
Theobbm 63. Let the square matrix A , of order re, have n distinct 

latent roots \ A„ and let t n be corresponding latent vectors. 

Let T be the square matrix having these vectors as columns. Then\ 

(i) T is non-singular, 

(ii) T~ l AT = diag(A 1 ,..., A„). 

Proof, (i) We prove that T is non-singular by showing that 

tj,..., t n are linearly independent. 

t The notation diag(V A fl ) ideates * matrix whose elements are 

X n iiTthe principal diagonal and zero everywhere else. 
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Let Jc n be numbers for which 

Mi+M*+-+M» = o. (i) 

Then, since At n = A„t n , 

+ + = \(^l t l + "- + ^»-l t i^-l)- 

That is, Bince At r = A,t r for r = 1, 1, 

fci(Ai— A„)ti+...+fc n _i(A n -i— XJt re -i = 0.\ 

Since A^.-.jA,, are all different, this is 

c 1 i 1 t 1 + ...+c n _ 1 fc n _ 1 t, l _ 1 = 0, \ (2) 

wherein all the numbers c v ...,c n-l are non-zero. 

A repetition of the same argument with n— 1 for n gives 

+ = 0 , 

wherein d n _ 2 are non-zero. Further repetitions lead, step 
by step, to ot 1 k 2 t x = 0, # 0. 

By hypothesis, tj is a non-zero vector and therefore k\ = 0. 
We can repeat the same argument to show that a linear rela- 

^° n k 2 t 2 -\-.~-\-k n t n = 0 

implies k 2 = 0 ; and so on until we obtain the result that ( 1 ) holds 

onl y if kl = k 2 = ... = k n = o. 

This proves that t,,...,t n are linearly independent and, these 
being the columns of T, the rank of T is n and T is non-singular, 
(ii) Moreover, the columns of A T are 

^ 1*1 

Hencef AT = TxdiagfA!,..., A, t ) 

and it at once follows that 

T-MT = diag(A 1 ,..., A n ). 

4.2. When the latent roots are not all distinct. If A 

is not symmetrical, it does not follow that a A - -pie root of 
\A— AI| = 0 gives rise to k linearly independent latent vectors. 

t If necessary, work out the matrix product 


r<u *..i X r Al 0 °1- 

M.i <» o A, o 

*-*«» *i* **»-* Lo 0 A,J 
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If it so happens that A has n linearly independent latent vectors 
ti,..., t n and T has these vectors as columns, then, as in the proof 
of the theorem, 

T~ l AT = diag^,..., A n ) 


whether the A’s are distinct or not. 

On the other hand, not all square matrices of order n have n 
linearly independent latent vectors. We illustrate by simple 
examples. 


(i) Let 



The characteristic equation is (a — A) 2 = 0 and a latent vector x, 
with components x x and x 2 , must satisfy Ax = «x; that is, 


atx 1 +x 2 — ax l , aX 2 = ocx 2 . 

From the first of these, x 2 — 0 and the only non-zero vectors 
to satisfy ix = ax are numerical multiples of 


= 



(ii) Let A = Ta 01. 

Lo «] 

A latent vector x must again satisfy ^4x = ax, This now re 
quires merely 

( XX X — QLX Xf OlX 2 — 0^X2* 


The two linearly independent vectors 



and, indeed, all vectors of the form x 1 e 1 +x 2 e 2 are latent 
vectors of A. 


5. Commutative matrices and latent vectors 

Two matrices A and B may or may not commute; they may 
or may not have common latent vectors. We conclude this 
chapter with two relatively simple theorems that connect the 
two possibilities. Throughout we take A and B to be square 
matrices of order n. 
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5.1. Let A and B have n linearly independent latent vectors 
in common, say t 1 ,...,t n . Let T be the square matrix having 
these vectors as columns. 

Then t iL are latent vectors of A corresponding to latent 
roots Aj,..., X n , say, of A ; they are latent vectors of B cprrespond- 
ing to latent roots p l9 ...,p n , say, of B . As in § 4.1 (ii), p. 210, 
the columns of A T are A,.t r (r = l,,..,n),\ 
the columns of BT are p r t r (r = \ 

and, with the notation \ 

L = M = diag 

AT = TL , BT = TM. 

That is T~ X AT = L, T-'BT = M. 

It follows that 

T~ l ATT~ l BT = LM = ML = T~ l BTT~ l AT , 
so that T- l ABT = T~ l BAT 

and, on pre-multiplying by T and post-multiplying by T" 1 , 

A B ~ BA . 

We have accordingly proved 

Theorem* 64. Tu’o matrices of arder n with n linearly indepen- 
dent latent vectors in common are commutative . 

5.2. Now suppose that AB = 2L4. Let t be a latent vector 
of A corresponding to a latent root A of ^4 . Then At = At and 

AB t = = 2?At = A/?t. (1) 

When B is non-singular Bt =£ 0 and is a latent vector of A 
corresponding to A. [J5t — 0 would imply t = B~*Bt =-- 0.] 

If every latent vector of A corresponding to A is a multiple of 
t, then Bt is a multiple of t, say 

Bt = kt , 

and t is a latent vector of B corresponding to the latent root 
k of B. 

If there are m , but not more than rn, f linearly independent 
latent vectors of A corresponding to the latent root A, say 

t By $ 1.4, m n. 



LATENT VECTORS 


213 


then, since At r = At r (r = 1 

provided that k x ,... 9 k ln are not all zero, 

M1+- + M* (2) 

is a latent vector of A corresponding to A. By onr hypothesis 
that there are not more than m such vectors which are linearly 
independent, every latent vector of A corresponding to A is of 
the form (2). 

As in (1), ABt r = \Bt r (r = 1 

so that Bt r is a latent vector of A corresponding to A; it is there- 
fore of the form (2). Hence there are constants k 6r such that 


Bt r = ^ k ar (»• = 1 m). 

a = 1 

For any constants l ly ... t l mt 

m m m jh / 7ii v 

B 2 btr - 2 b 2 M. = 2 ( 2 bK)t„. 

r=l r=l s = l * = 1 'r — 1 ' 


Let 6 be a latent root of the matrix 


A*n 

* 


^ ml 

. 

Jr 

n ' mri 


Then there are numbers (not all zero) for which 


1 l r K = (s=K-.,fn) 

r= 1 

and, with this choice of the l n 

m ni 

B2b t r = 0 2 I ' t r 

r-= 1 « = 1 

Hence 0 is a latent root of B and J l r t r is a latent vector of K 
corresponding to 0; it is also a latent vector of A corresponding 
to A. We have thus provedf 

Theorem 65. Let A, B be square matrices of order n;let \B\ #0 
and let AB = BA. Then to each distinct latent root X of A corre- 
sponds at least one latent vector of A which is also a latent vector 
of B. 

t For a comprehensive treatment of commutative matrices and their 
common latent vectors see S. N. Afriat, Quart. J. Math . (2) 5 (1954) 82-85. 
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Examples XV 


1. Referred to rectangular axes Oxyz the direction cosines of OX, OY, 
OZ are ), (Z,,m„n a ). The vectors OX, OY, OZ are 

mutually orthogonal. Show that 


h 

m 1 


It 

m a 

n 

ft 

m a 

n s 


is an orthogonal matrix. 

2. Four vectors have elements \ 

(a, a, a, a), (6,— 6,6,— 6), (c,0, — c,0), (0,d,0, — d% 

Verify that they are mutually orthogonal and find positive values of 
a, b, c, d that make these vectors columns of an orthogonal matrix. 

3. The vectors x and y are orthogonal and T is an orthogonal matrix. 
Prove that X = Tx and Y — Ty are orthogonal vectors. 

4. The vectors a and b have components (l ( 2 f 3,4) and (2, 3, 4,5). 

Find k, so that b, = b + Jfc.a 

is orthogonal to a. 

Given c = (1,0,0, 1) and d = (0,0, 1,0), find constants l lt m l9 l t ,m t , n a 
for which, when = c + fj^ + mj a, = d -MjCi + m a b 1 +n a a, the 
four vectors a, b 1# c lf are mutually orthogonal. 

5. The columns of a square matrix A are four vectors a lv ...,a 4 and 

A' A = diag(aj, acj, aj, aj). 

Show that the matrix whose columns are 

af 1 a 1 ,...,a^ 1 a 4 

is an orthogonal matrix. 

fl. Prove that the vectors with elements 

(1, —2,3), (0,1, -2), (0,0,1) 
are latent vectors of the matrix whose columns are 
(1,2, -2), (0,2,2), (0,0,3). 

7. Find the latent roots and latent vectors of the matrices 

10 0] 1 0 0] , 

2 10 0 10 

-2 2 3j -2 2 3 


showing (i) that the first has only two linearly independent latent vectors; 
(ii) that the second has three linearly independent latent vectors 
li = (1,0,1), l a = (0, 1, — 1), 1, = (0,0,1) and that A^ + fc.l. is 0 
latent vector for any values of the constants k x and Jb t . 

8. Prove that, if x is a latent vector of A corresponding to a latent root 
A of A and C = TAT“ l , then 

CTx = TAx » XTx 
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and Tx is a latent vector of C corresponding to a latent root A of C. Show 
also that, if x and y arc linearly independent, so also are Tx and Ty. 
9. Find three unit latent vectors of the symmetrical matrix 

'2 0 O' 

0 -3 1 

|_0 1-3. 

in the form 


and deduce the orthogonal transformation whereby 

2x a -3y a -3z a + 2?/z = 2X a -2y a -4Z 2 . 

10. (Harder f) Find an orthogonal transformation from variables 
x, y t z to variables X 9 Y t Z whereby 


2yz + 2zx -|- 2xy = 2X a -y a -Z a . 


11. Prove that the latent roots of the discriminant of the quadratic 
f ° rm 2yz+2zx + 2xy 


are — 1, — 1, 2. 

Prove that, corresponding to A = — 1 , 

(i) (x,y,z) is a latent vector whenever x + y+z = 0; 

(ii) (1, — 1,0) and (1,0, — 1) are linearly independent and that 
(1 , — 1,0) and (1 +fc, — k, — 1 ) are orthogonal when k = — J; 


(»i) 


a- (v2’ V2 ’ °) ’ 


b = 


J 2\ 

^ V6 ’ V6 * V6/ 


are orthogonal unit latent vectors. 

Prove that a unit latent vector corresponding to A 


c = 


(_L JL M 
wa’vsvsr 


2 is 


Verify that when T is the matrix having a, b, c as columns and x = TX, 
where x = (x,y,z) and X = (X f Y,Z) f 

2yz + 2zx + 2xy = -X 2 -y a + 2Z a . 

12. Prove that x = Z, y = Y, z = X is an orthogonal transformation. 
With n variables x lf ...,x n , show that 

X| = X a9 X| = Xflt = 

where at, jS,..., k is a permutation of 1, 2,..., n, is an orthogonal transforma- 
tion. 

1 3. Find an orthogonal transformation x = TX which gives 

2yz + 2zx = (y*-Z*)V2. 

14. When r = 1 , 2, 3, a = 2, 3, 4 and 

x'Ax = 2 £ x f x f , 

r<$ 

t Example 1 1 is the same sum broken down into a step-by-step solution. 
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prove that the latent roots of A are — 1 , — 1 , — l , 3 and that 

( 1 , 0 , 0 , - 1 ), ( 0 , 1 , - 1 , 0 ), ( 1 ,- 1 , - 1 , 1 ), ( 1 , 1 , 1 , 1 ) 

are mutually orthogonal latent vectors of A . 

Find the matrix T whose columns are unit vectors parallel to the aV>ove 
(i.e. numerical multiples of them) and show that, when x - TX, 

x'Ax = -Xl-X? 2 ^X‘l + 3Xl I 

15. Show that there is an orthogonal transformation | X ~ TX whereby 

2x 1 x a -^2x 1 x 4 —4jr 2 .r 3 -\-4x 2 x t = ^2(XJ-f 2X\ -2A\5 — X*). 

16. A square matrix A, of order n, has n mutually orthogonal unit 
latent vectors. Prove that A is symmetrical. 

17. Find the latent vectors of the matrices 


1 

6 

r 

i 

-2 

-1 

-5 

1 

2 

0 


1 

2 

1 

0 

0 

3. 


3 

1 

6. 


18. The matrix A has latent vectors 

(1,0,0), (1,1,0), (1,2,3) 

corresponding to the latent roots A — 1,0, — 1 . The matrix B has the 
same latent vectors corresponding to the latent mots /j, - 1,2,3. Find 
the elements of A and B and verify that A B = BA. 

1 9. The square matrices A and B are of order n; A has n dist inct latent 
roots Aj,...,A n and B lias non zero latent roots /Xj not necessarily 
distinct; and AB = BA. Prove that there is a matrix T for which 

T-'AT = diag(A, A„), T~ X BT = diagf/i, 


Hints and Answers 

2. o = 6 = J, c — d = 1/V2; these give unit vectors. 

3. Work out X'Y. 

4. fc, = --J; l x — i, rn v = l 2 — i, m 2 = 0, n a = —A; cf. § 3.2. 

6. A = 1,2,3. 

7. (i) A = 1, 1,3; vectors (0, 1, — 1), (0,0, 1). 

9. A = 2, — 2, — 4; use Theorem 62. 

12. x = TX gives x'/x = X'T'TTX = X'T'TX. The given trans- 
formation giv es ^x 2 JX 2 ; therefore T'T = I. 

13. A — 0, dr ^2; unit latent vectors are 

(± __L o) (i 1 ±.\ (II — M 

V2’ V2 ’ \2’ 2’ V2/’ \2’2’ V2/‘ 

Use Theorem 62 . 

t The actual transformation is not asked for : too tedious to be worth the 
pains. 
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16 . Lett, t„ be the vectors and T have the t’s as columns. 

As in 5 4.1, T~'AT = diag(A 1 A„). But T" 1 = T and so 

TAT - diag(A lt ...,A n ). 

A diagonal matrix is its own transpose and so 

TAT = (TAT) 9 = T'A'T and A = A\ 


17. (i) A = 

-1,3, 

4; vectors (3, ■ 

-1,0), (1,1. 

» 

4), 

(2,1 

(ii) A = 

1,2,3; 

vectors (2, — 1, — 1), (1,1, 

, - 

1), 

(1.0 

18. 

A = 

1-1 O' 

, B = 

1 

1 

O' . 



0 0 -f 


0 

2 

I 



0 0 - 1 . 


0 

0 

3. 


19, A has n linearly independent latent vectors (Theorem 63), which 
are also latent vectors of B (Theorem 65). Finish as in proof of Theorem 
64. 
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